the ASME 


Nonlinear Phenomena .. . CA. Ladehe 


Response . . 
J Taft, and R. Schuman 


A Dual-Mode Damper-Stabilized Servo . . . . . . « J. Jursih, J. F. Kaiser, and J. E. Ward 


Amplitude Noise om Googe Van 


of Nonlinear of Two Variables on Analog Equipment 


Basic Methods for Nonlinear Control-System Analysis . . « 


. Doll and T. M. Stout 


Physical and Mathematical Mechanisms of Instability in Nonlinear Automatic Control Systems 


Hunting DuetoLost Motion . ... - « Poritsky 


Compensation of « Velocity Servomechanism With Backlash ‘ 
Lag C.N. Shen, H. A. Miller, and N. B. Nichols 


Flow Through Annular Orifices Bell and 0. P, Bergelin 
How RF Concerns the Wood Industry . . : : - « «J. W. Mann 
on Mamie Angie. S. M. De Corse and G. A. Kemeny 


ing Blade Rows. . . 


Operating Experience and Design Features cf Closed-Cycle Gas-Turbine Power Plants Cwrt Keller 


. J. B. Esgar, J. N. B. Livingood, and R. 0. Hickel 


Generalised Optimal Heat-Exchanger Design « « « « DH Fax and R. Mills, Jr. 


ove - Robert Siegel and R. H. Norris 


A Mechanical Computing Device for the Analysis of One-Dimensional, Transient, Heat-Conduction 
Problems .. W. E. Howland, E. A. Trabant, and G. A. Hawkins 


A en cana of iiieieas When Used in Va Compression Cycles Over an Extended Tem- 
perature Range. . . Barger, W. M. Robsenow, and K. M. Treadwell 


of Cmplex Geometry to Relative Velocities and Acceleraions in Mechanisms 
« + G. HL Martin and M. F. Spotts 


TRANSACTIONS OF THE AMERICAN SOCIETY OF MECHANICAL ENGINEERS 


VOLUME 79 APRIL 1957 NUMBER 3 


4 
439 
| 
455 
465 
473 
481 
Elliott 489 
497 
513 
527 
547 
553 
.., 
577 
585 
593 
603 | 
607 
627 | 
645 
653 
663 
| 675 | 
681 
687 


Transactions 


of The American Society of Mechanical Engineers 
Published on the tenth of every month, except March, June, September, and December 


OFFICERS OF THE SOCIBTY: 
W.F. Rran, President 
Josurs L. Koer, Treasara C. E. Davins, Secretary 
Epoar J. Kates, Asst. Treasurer 


COMMITTEE ON PUBLICATIONS: 


W. E. Reaser, Chairman 
Kare ATKINSON B. G. A. Sxnorzx: 


Jou vz S. N. Biacxmon 
} Advisory Members 
A. Sretson, Editer Emeritus Jj. A. Norra, Production 


J. J. Jn., Editor 


REGIONAL ADVISORY BOARD OF THE PUBLICATIONS COMMITTEE: 


Sante of Mechanical Engineers. Publication office at 20th and N: fiogteemense Easton, Pa. 
department the the Society, 29 West ‘Ninth Stree, New York 18, N. Y. Mecbanece,” New 
$1 a copy, 00 annually for Transactions and the Journal of Applied jon. Chan es of mas a copy, $6.00 annually, Add $1.50 for 
postage to all countries outside the United States, Saaede. and Pan American — x Be received at Society headquarters 
seven weeks eepayed they are to be effective on the a. Please send old as well as new a The Society shall not be responsible 


ad 
Easton, Co hted, 1957, by The American 


Roy L. H. M. 
Guewn R. C. R. 
F. J. Hemze—lll M. B. Hooan—VII 
Francis C. Surra—IV Hetanpsr—VIII 


Nonlinear Phenomena 


By C. A. LUDEKE,' CINCINNATI, OHIO 


It is the purpose of this paper to present briefly some 
phenomena associated with nonlinear systems. The phe- 
nomena discussed are wave form, frequency dependence 
on amplitude, the jump phenomenon, subharmonic os- 
cillations, limit cycles, and frequency entrainment. Ex- 
perimental examples of such phenomena are included and 
a representative bibliography is given. 


INTRODUCTION 


OR the past 15 years it has been the author’s pleasure to 
F work in the field of nonlinear mechanics. Although so 
named (1)? this field treats any phenomena, mechanical, 
electrical, hydrodynamical, and so on, which can be described by 
nonlinear differential equations. Since most physical phenomena, 
when accurately represented, require nonlinear differential equa- 
tions, the task at first glance seems to be to find solutions which 
are in better agreement with reality and which avoid the idealiza- 
tion of having been linearized. A simple step into the field en- 
courages this belief. For example, consider the equations 


mé + ct + kr = F cos at..... 
mi + ct + ax + Bx? = Fcoswt...........[2] 


The first equation is linear and we know the solution to be sinus- 
oidal; the second equation is quasilinear if Bz*< az, represents 
many systems more accurately than Equation [1], and has an al- 
most sinusoidal] solution as shown in Fig. 1 (2). Perhaps our task 


Fon rad. 


Se 


Fic. 1 Anatog or Equation [2] Witn << az 
is “merely” to get better and better solutions by means of higher 
and higher approximations. This never seemed very exciting to 
the author and fortunately this is not the case. His suspicions 
were aroused by Equation [3] 


mé + cz + Bz? = F coswt.............. [3] 


which is the same as Equation [2] with a = 0. Two things now 
confound the issue; if you were hoping to use quasilinear methods 
you now have no linear equation with which to begin, and if you 


1 Department of Physics, Graduate School of Arts and Sciences, 
University of Cincinnati. 

2? Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by the Instruments and Regulators Division of Tue 
Society oF Mecuanicat ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
10, 1956. Paper No. 56—IRD-7. 


use an analog and expect a radical change between the solutions 
of Equations [1], [2], and [3], because the mathematics dictates 
a radical change in procedure, you are again in difficulty as shown 
in Fig. 2(3). So although all of the physics must be in the solu- 
tion, a casual look at them does not tell the story. Perhaps, there- 


rad. 


Fic. ANALoG So.ution oF Equation [3] 


fore, we should begin to examine phenomena which show marked 
contrasts when occurring in linear and nonlinear systems. 


FREQUENCY-AMPLITUDE DEPENDENCE 
The simplest and most familiar of these is the frequency-ampli- 


tude dependence which exists in oscillations occurring in nonlinear 
systems. Consider systems represented by 


mi + cit + kx = 0 
mi + ct + ar + =0 


Both of these equations represent damped oscillations. Equation 
[4] is linear, and as the amplitude of the oscillation decreases the 
frequency remains unchanged. Equation [5] is nonlinear, and 
Fig. 3 (4), which gives an analog solution for 8 > 0, shows that the 
frequency decreases as the amplitude decreases. Fig. 4 shows the 
frequency-amplitude relationships for 8 equal, less than, or greater 
than zero. This is the simplest nonlinear characteristic to detect 


BA PAPA 


Fic.3 So_vuTion or Equation [5] WitH 8 > 0 
(Note dependence of period on amplitude.) 


FREQUENCY 


Fie. 4 FrRequency-AMPLITUDE RELATIONSHIP OF VARIOUS SOLU- 
TIons oF Equation [5] 
(Free vibrations, linear ard nonlinear.) 


439 


440 


and a graph of the form shown in Fig. 4 tells you not only that non- 
linearity is present but also gives an indication of its magnitude. 


Jump PHENOMENON 


If we examine the frequency-amplitude relationship in non- 
linear forced oscillations we discover a very striking example of 
nonlinear phenomena. This is the so-called “jump’’ phenomenon. 
Consider once again Equation [1]; it is linear, and the well-known 
resonance results are shown in Fig. 5. A similar set of results for 
Equation [2], which is nonlinear, is shown in Fig. 6 (5). Im- 
mediately we note that the “resonance curves”’ are discontinuous 
and follow different paths for increasing and decreasing frequencies. 
This is emphasized in Fig. 7. 


FeO 


2 


FREQUENCY 


Fic. 5 FReEQUENCY-AMPLITUDE RELATIONSHIP FOR SOLUTION OF 
EqvatTion [1] 
(Forced linear vibrations.) 


Fic. 6 FREQUENCY-AMPLITUDE RELATIONSHIP FOR SOLUTION OF 
Equation [2] Wirn F Proportional TO w* 
(Forced nonlinear vibrations.) 


TUDE 


AMPLITUDE 


deo 


FREQUENCY 


Fic. 7 Jump PHeNoMENA IN FREQUENCY-AMPLITUDE RBLATION- 
SHIPS OF Forcep NONLINEAR VIBRATIONS 


TRANSACTIONS OF THE ASME 


Not only is Fig. 6 completely different from Fig. 5, but it is also 
not the only one possible. In considering Equation [1] there is 
no question about the fact that the solution must have the same 
frequency as the frequency of the forcing function. However, 
there is no reason to believe that this also must be true for Equa- 
tion [2] (6,7). Actually, solutions exist whose frequencies are 
multiples or submultiples of the forcing frequency (8). Thus 
Equation (2] has superharmonic and subharmonic solutions as 
well as the standard harmonic response. For subharmonics this 
means that an equation of the form 


+ f(x) = F coswt...............- [6] 
will have as one of its solutions 
z = — cos (wt/n) + .... + A: cos wt + 


Thus the frequency of the fundamental! will be a submultiple of 
the frequency of the forcing function. Specific results from an 
analog are shown in Fig. 8 (9). A device for demonstrating sub- 
harmonic oscillations qualitatively is shown in Fig. 9 (10). We 
are now in a field of investigation which, with the exception of 
subharmonics of order one half, is entirely a field of nonlinear 
phenomena. We can find [for example, see Fig. 10 (11)] that 
there is a frequency-amplitude relationship for subharmonic solu- 
tions. From an actual analog record displayed in Fig. 11 (12) 


Fic. 8 Wave Forms 


(Upper record shows a subharmonic of order one fifth; lower, a subharmonic 
of order one third.) 


Fig. 9 Device ror DeMonsTRATING SUBHARMONIC VIBRATIONS 


4 | | 
| 
| 
9 
FREQUENCY sin@=@ \ 


APRIL, 1957 


AMPLITUDE IN 


> 


6 


FREQUENCY OF UNBALANCE IN CYCLES/SEC 


FREQUENCY-AMPLITUDE RELATIONSHIPS FOR SUBHARMONIC OSCILLATIONS MAINTAINED 
In A NONLINEAR SysTeM By A RotTaTinG UNBALANCE 
(Vertical jumps take place between the heavy solid lines.) 


Fig. 11 TRANSITIONS 


(At A the jump for harmonic oscillation takes place, and as the frequency increases we have a subharmonic of order '/: at B. This subharmonic has its 
own jump at C, and settles into a '/: subharmonic at D.) 


we note that transitions may occur between subharmonics of 
different orders, and that these transitions are associated with a 
subharmonic jump phenomenon. An entire field is now opened 
and it is obvious many questions are yet to be answered. Since 
this is the author’s own particular field of interest, he must be on 
guard against talking too much about it. Let’s move on to 
another nonlinear phenomenon, that of self-excited oscillations. 


Se.r-Excirep OsciLLATIONS 
Consider an equation of the form (13) 


mé + + kx = 


This equation is nonlinear in the damping term and has no term 
corresponding to a forcing function. An examination of the non- 


linear damping term indicates that for small values of z the damp- 
ing will be negative and will actually put energy into the system 
while for large values of zx it is positive and removes energy from 
the system. Thus we are not too surprised that such a system 
may have an oscillatory solution; and since it is not a forced sys- 
tem we call these oscillations self-excited oscillations. 

A system capable of self-excited oscillations is shown in Fig. 12, 
and the resulting solutions are shown in Fig. 13. Perhaps the 
most interesting aspect of this nonlinear phenomenon is asso- 
ciated with the final amplitude of oscillation, the so-called limit 
cycle. This limit cycle depends only on the parameters of the 
system, not on the initial conditions. This is quite contrary to 
linear theory. Usually the limit cycles are plotted as closed trajec- 
tories in the phase plane as shown in Fig. 14. A common mecha- 


| 441 
[ es 1/2 
‘ 
| j 
‘ 
| 
ge \/3 
_ 
¢ 
‘ we 
Pd 
| 
| A 
SSS LS SSL SLD 
RD 


442 


K 
M ORY 
ICTION 


Fic. 12 A Mecuanicat System CapasLe or Causinc Mass M To 
Exursit Sevr-Excirep Because OF NONLINEARITY 
or Dry Fricrion Between Mass anp Movine Bett 


Fic. 13. Type or Micut Be GENERATED BY 
System SHown In Fic. 12 


ENDLESS BELT 


nism exhibiting a limit cycle is an ordinary clock. This cannot be 
considered an example of a self-excited oscillation because it is an 
oscillatory-damped system excited by shocks twice per period 
from an external source of energy. This energy, released by the 
escapement, replenishes the energy lost per half cycle and closes 
the phase trajectories which then become limit cycles. If the 
clock is wound from rest it is immaterial whether a smail or large 
impulse is used to start it; the final operation is independent of 
the initial conditions. 


ENTRAINMENT OF FREQUENCY 


If a periodic force of frequency w is applied to an oscillating 
system of frequency wo, one observes the well-known phenomenon 
of beats. As the difference between the two frequencies decreases, 
the beat frequency also decreases. In a truly linear system, using 
linear theory, we find that the beat frequency should decrease in- 
definitely as w — wo — 0. In actual practice, however, most 
systems are not truly linear and we find experimentally that the 
oscillator frequency wo falls in synchronism with, or is entrained 
by, the external frequency w within a certain band of frequencies. 
This phenomenon is called entrainment of frequency and the band 
of frequency in which entrainment occurs is called the zone of en- 
trainment. The zone of entrainment is indicated in Fig. 15 by the 
region Sw, which is the region where w and w» coalesce and only 
one frequency w exists. If the theory were linear, the relationship 
between \w — a»! and w would follow the dotted lines and 
|w — we! would be zero for only one value of w = wo. In Fig. 16 
we see a system capable of demonstrating frequency entrainment, 
and in Fig. 17 we see the data obtained from this system in opera- 
tion. 

CONCLUSION 


It has been a privilege to discuss some of the phenomena 
associated with nonlinear systems. There is not too much known 
about any of these phenomena and each could serve as an excellent 
field of research, both pure and applied. 


TRANSACTIONS OF THE ASME 


Fic.14 Typrcat Paoase-PLANe REPRESENTATION FOR SELF-ExcITED 
OscILLATIONS 


Ww 


Jo 


fe 
Fic. 15 Zone or ENTRAINMENT 
ROTATING 
SHAFT, 
FRICTION 
ROTATING 
ECCENTRIG 
STRONG SPRING 
= 
PENDULUM 


Fic. 16 Froupe Penpu.um Driven By A Periopic Force 


(Under proper conditions such a system may be used to demonstrate fre- 
quency entrainment.) 


APRIL, 1957 


17. Typica Trwe-DispLacement CurRvVE FOR MECHANISM 
SHown IN 16 
(Cessation of beats indicat 


Fic. 


entrai t.) 


BIBLIOGRAPHY 


1 Perhaps named after the work entitled ‘Introduction to Non- 
linear Mechaniecs,”’ by N. Kryloff and N. Bogoliuboff, Kief, USSR, 
1937. A free translation of extracts of this work was given by 8. 
Lefschetz, Princeton University Press, Princeton, N. J., 1943. 

“Theory of Oscillations,”’ by A. Andronow and 8, Chaikin, Moscow, 
USSR, 1937. 

“Introduction to Nonlinear Mechanics,’’ by N. Minorsky and J. W. 
Edwards, Ann Arbor, Mich., 1947. This book contains an exten- 
sive list of references. 

“Nonlinear Vibrations,’’ by J. J. Stoker, Interscience Publishers 
Inc., New York, N. Y., 1950. 

“Ordinar$¥ Nonlinear Differential Equations,’’ by N. W. McLach- 
lan, Oxford University Press, London, England, 1950. 

“Forced Oscillations in Nonlinear Systems,” by C. Hayashi, Nippon 
Printing and Publishing Company, Osaka, Japan, 1953. 

2 ‘An Experimental Investigation of Forced Vibrations in a 
Mechanical System Having a Nonlinear Restoring Force,’’ by C. A. 
Ludeke, Journal of Applied Physics, vol. 17, 1946, pp. 603-609. 

3 “Analog Computer Elements for Solving Nonlinear Differen- 
tial Equations,”” by C. A. Ludeke and C. L. Morrison, Journal of 
Applied Physics, vol. 24, 1953, pp. 243-248. 

4 ‘An Electro-Mechanical Device for Solving Nonlinear Dif- 
ferential Equations,”’ by C. A. Ludeke, Journal of Applied Physics, 
vol. 20, 1949, pp. 600-607. 

5 ‘Predominantly Subharmonic Oscillations,’’ by C. A. Ludeke, 
Journal of Applied Physics, vol. 22, 1951, pp. 1321-1326. 

6 ‘‘The Generation and Extinction of Subharmonies,’"’ by C. A. 
Ludeke, reprinted from Proceedings of the Symposium on Nonlinear 
Circuit Analysis, Polytechnic Institute of Brooklyn, Brooklyn, N. Y., 
1953. 

7 “Subharmonic Oscillations in Nonlinear Systems,’ by C. 
Hayashi, Journal of Applied Physics, vol. 24, 1953, pp. 521-529. 

8 ‘Harmonic, Superharmonic, and Subharmonie Response for 
Single Degree of Freedom Systems of the Duffing Type,"’ by J. C. 
Burgess, Technical Report No. 27, Contract N6-ONR-251 Task Order 
2 (NR-041-943), Stanford University, Stanford, Calif., 1954. 

9 Same as (5). 

10 “A Mechanical Model for Demonstrating Subharmonic Res- 
onance,”’ by C. A. Ludeke, American Journal of Physics, vol. 16, 1948, 
pp. 430-434. 

11 Same as (5). 

12 Same as (5). 

13. Relaxation Oscillations," by B. Van der Pol, Philosophical 
Magazine, vol. 2, November, 1926 


Discussion 


Hayasui.? Subharmonic oscillations of order 1/2 
may occur in nonlinear systems and also in linear systems when 
system parameters change periodically with time. The device 
shown in Fig. 9 of the paper has a nonlinear characteristic pro- 
vided by the guiding support of the pendulum. It also has a time- 
varying characteristic given by the rotating mass m around the 
motor. it is rather difficult to see which one of these character- 
istics (nonlinear or time-varying) gives rise to the subharmonic 
response. If the guiding surface of the support is removed and 
still subharmonic oscillations of order 1/2 could be obtained, one 
could conclude that the subharmonic response is due to the time- 
varying characteristic of the pendulum. 

Generally speaking, subharmonic oscillations of order 1/2 in 


* Department of Mechanical Engineering, Massachusetts Institute 
of Technology, Cambridge, Mass. 


443 


nonlinear systems are apt to occur when f(z) in Equation [6] 
is a nonodd function of z, for example, when f(x) is given by 


f(z) = et + + 


However, even when the nonlinearity is symmetrical, i.e., when 
f(z) is an odd function of z, subharmonic oscillations of order 
1/2 may occur. In this case the nonlinearity is symmetrical, 
but the oscillation itself becomes unsymmetrical and self-biased. 
In other words, the oscillation contains a unidirectional com- 
ponent. Further, subharmonic oscillations of order 1/2 with 
different amplitudes and wave forms may occur in the same sys- 
tem under the action of an equal periodic force. This is evi- 
dently due to the nonlinearity of the system and cannot be ex- 
plained by the time-varying characteristic. 


K. Kiorrer.‘ The author has given a survey of a number of 
nonlinear phenomena and, with his often displayed skill, has 
demonstrated some of them by well-performed experiments. 

Among the experiments shown at the Conference (not recorded, 
however, in the preprints of the paper) was the pendulum with 
oscillating support. As the author has mentioned, it is well 
known’ that such a pendulum may oscillate at a frequency which 
is half the frequency of the support, and it is equally well known 
that the phenomenon can be described by a Mathieu differential 
equation of form 


£4 0 


where U and Q denote amplitude and frequency of the support, 
lis the equivalent length of the pendulum, and z = Qt. Equa- 
tion [9] is a linear differential equation with nonconstant (sin- 
usoidally varying) coefficients. 

The author then went on to demonstrate very clearly sub- 
harmonics of order 1/2, 1/3, 1/5, 1/9, ete., by means of the de- 
vice shown in Fig. § of the paper. The subharmonics in this 
experiment were all attributed to the nonlinearity in the restoring 
force of that device. 

The writer has some doubts as to the correctness of this ex- 
planation in regard to the subharmonics of order 1/2 (or of any 
order 1/2n). Certainly, the experiment, as described and per- 
formed leaves room for other explanations: Because the exciting 
force is provided by the inertia force of a single rotating mass, 
it has not only a horizontal component (as used for explaining 
the results of the experiment) but also a vertical one. This 
sinusoidally varying vertical component of the inertia force pro- 
duces a periodically varying effective stiffness of the spring;’ an 
equation of motion of the Mathieu type will follow and hence the 
situation is quite analogous to the one of the pendulum with 
oscillating support. The subharmonics observed may be solu- 
tions of a (linear) Mathieu differential equation, not solutions of 
a nonlinear differential equation. 

The writer attributes quite a bit of significance to this situation, 
because in so far as he is aware, the existence of subharmonics of 
order 1/2 (or of order 1/2n generally) in systems having non- 
linear restoring forces which are point-symmetrical (represented 
by purely odd functions of the displacement) has not yet been 
proved or disproved beyond doubt. Hence it would seem highly 
desirable to refine the experiments shown by the author in two 
different ways: 


1 The experiment of Fig. 9 should be carried out with a linear 
spring (omitting the constraining walls and using very small dis- 

‘Professor of Engineering Mechanics, 
Stanford, Calif. Mem. ASME. 

§ “Stabilisierung und Labilisierung durch Schwingungen,”’ by K. 
Klotter, Forschung, vol. 12, September—October, 1941, p. 209. 


Stanford University, 


444 


placements). If the subharmonics of order 1/2 still readily 
appear (as this writer surmises) the explanation by means of the 
linear Mathieu equation is the only one applicable. 

2 In the nonlinear device of Fig. 9 an excitation should be 
produced by means of a purely horizontal force (sinusoidally 
varying). This could be accomplished by the use of two masses 
rotating clockwise and counterclockwise, respectively, thus 
destroying their vertical components. Depending on whether 
or not a subharmonic of order 1/2 will appear in this case one 
would have an indication about the existence of subharmonics of 
order 1/2 in systems having odd restoring forces. 


The writer would like to urge the author strongly to devote 
his experimental skills to finding clear-cut answers to this still 
rather beclouded problem of subharmonics of order 1/2. 


AuTHOR’s CLOSURE 


Drs. Hayashi and Klotter both indicate in their discussions 


TRANSACTIONS OF THE ASME 


that the subharmonic of order one half may arise in systems 
which have nonlinearities, in systems which have time-varying 
characteristics, and in systems which have both. Since the 
system consisting of a rotating unbalance mounted on top of » 
vertical cantilever has both characteristics, it is difficult’ to 
decide which gives rise to the subharmonic. 

Both writers suggest an experiment in which the nonlinearity 
is removed, but not the time-varying characteristic; and Dr. 
Klotter suggests also the possibility of removing the time- 
varying characteristics while retaining the nonlinearity. The 
author is in full agreement with these suggestions, and as soon 
as time permits he will perform these experiments and any 
others which suggest themselves in order to locate uniquely the 
characteristic which gives rise to subharmonic oscillations of 
order one half. 

The author wishes to thank the discussers for many kind 
words of encouragement and many fine discussions while at the 
Princeton meeting. 


A Résumé of the Development and Literature 
of Nonlinear Control-System Theory 


By T. J. HIGGINS,' MADISON, WIS. 


“Es gibt nichts Praktischeres als eine gute Theorie 
(There is nothing more practical than a good theory)’’ 
—Boltzmann. 

The first section of this paper stresses the fact that non- 
linear control-system theory is rooted in the theory de- 
veloped earlier for the solution of other types of nonlinear 
systems, especially those of nonlinear mechanics and 
nonlinear electric circuits; whence the control engineer 
interested in nonlinear systems should gain a thorough 
knowledge of nonlinear system theory in the large. To 
facilitate such study this paper advances a concise resume 
of the major stages of development of nonlinear system 
theory to date; a list of papers which encompasses a more 
detailed account of development; a list of the principal 
books on general theory; and a list of sources on special 
aspects of the general theory— each item being character- 
ized as to its particular merit. The principal methods of 


nonlinear analysis and the phenomena of particular in- 
terest in nonlinear systems are noted. Finally, the pos- 
sible use of these methods and the actual occurrence of 
these phenomena in nonlinear control-system analysis are 
emphasized. 


I Tue Roots or NoNLINEAR ContTROL THEORY 


HE theory and methods pertinent to the analysis of present- 

day automatic control systems are essentially the same as 

the theory and methods developed for the analysis of non- 
linear problems encountered in certain domains of nonlinear me- 
chanics and nonlinear electric circuits. In such connection, 
much of this theory and many of the methods have long been 
studied, highly developed, and well used for the solution of numer- 
ous different kinds of problems. Direct study of most of the 
some two hundred papers published to date on nonlinear control 
theory reveals: 


1 That essentially all of the theory and methods used to effect. 
analysis of the operating performance of the systems considered 
in these papers are paralleled in papers published earlier in the 
other mentioned domains. For example, the basic tenets of the 
much-employed method of “describing functions’’ are essentially 
the same as for Kryloff’s method of first approximation, used 
by him some 20 years ago for the study and solution of various 
nonlinear electrical and mechanical problems. 

2 That a great deal of theory and numerous powerful methods 
developed for the solution of problems in nonlinear mechanics 
and nonlinear electric circuits have not yet been employed for 
nonlinear control-systems analysis—although such use could 
prove most fruitful. 

In such state, it would seem that the control engineer who is 


1 Department of Electrical Engineering, University of Wisconsin. 

Contributed by the Instruments and Regulators Division of THe 
American Society or MECHANICAL ENGINEERS and presented at the 
ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, 
January 12, 1956. Paper No. 56—IRD-4. 


interested in obtaining a thorough grounding in theory and meth- 
ods, which will afford him knowledge both of the causes of various 
puzzling (to many workers) phenomena occurring in nonlinear 
control systems and of means of analysis of such systems, could 
hardly do better than to study and master the theory and meth- 
ods that have been developed to date in the remarked fields of 
nonlinear mechanics and nonlinear electric circuits. Such would 
not prove especially difficult; for, as evidenced in the following: 


1 The essential literature is quite circumscribed. 

2 Much of the essential content is readily available in English. 

3 In most of this content, emphasis is on basic theory, with 
application illustrated by rather simple physical systems; 
whence no considerable knowledge of a specific branch of tech- 
nology is required for ready assimilation. 


Il Tae Time Stream or NONLINEAR THEORY 


A desirable first step in undertaking such a study would be to 
gain perspective of the domain of nonlinear (dynamic) theory as 
a whole, both in time and in content. 

The time stream is clearly marked and consideration thereof 
tends to a natural division into several stages of progress which 
may be characterized broadly as follows: The first stage en- 
compasses the initial work, in the eighteenth and much of the 
nineteenth centuries, on various nonlinear problems whereof 
solutions could be effected in a specific sense, either in closed 
form (as in determination of the large oscillations of a pendulum 
in terms of elliptic functions) or—more usually—in an approxi- 
mate form, by linearizing the nonlinear equations and obtaining 
therewith approximations valid over limited ranges of the in- 
dependent variables (as in the so-called small oscillations of a 
pendulum). 

A second stage comprises the period of, roughly, 1880-1920. 
On one side it encompasses such powerful and generally applicable 
developments as Poincaré’s topological research (1881-1882), lead- 
ing to the concept and theory of limit cycles; Linstedt’s successful 
solution (1883) of the secularization difficulty which had baffled 
Poisson and others in their efforts to effect successful series ex- 
pansions in periodic terms; Poincaré’s work (1892) in effecting 
analytic solution of a system by means of series expansions in 
terms of powers of certain numerically small parameters charac- 
terizing the nonlinearity; Liapounoff’s work (1892) on stability 
theory; and Bendixsen’s investigations (1901) on limit cycles. 
On the other side, this second stage embraces development of a 
number of rather specialized procedures enabling approximate 
solution of special problems—as typified, say, in Martienssen’s 
paper (1910) and in Duffing’s book (1918). In the large, this 
work was concerned with study of problems in nonlinear me- 
chanics and effort, in general, was largely concentrated in this area. 

In the third stage, however—from about 1920 to 1940—the 
most rapid advance in the theory and methods of nonlinear anal- 
ysis, and in insight into, and understanding of, nonlinear phe- 
nomena, stemmed from focus of attention on various phases of 
nonlinear electric-circuit analysis. Thus this period opened with 
van der Pol’s analysis (1920) of the triode oscillator by his 
specially developed method. From this work originated a great 
volume of subsequent associated investigation on electron-tube 


445 


446 


devices. This includes (to cite some of the major workers 
whose names are yet familiar) that of Appleton and Greaves in 
England; Liénard, Le Corbeiller, and Rocard in France; van der 
Pol and van der Mark in The Netherlands; and in the later 
1930’s and early 1940’s, under the stimulus of military com- 
munications research, Cartwright and Littlewood in England 
and Levinson in the United States. 

In the later 1920’s and through the 1930's, however, the greater 
volume of work was carried out in Russia. In the large, two 
groups of workers are to be distinguished. One comprised 
Mandelstam, Papalexi, Andronow, Chaikin, Witt, and others, 
centered in Moscow and Leningrad, whose work is largely rooted 
in Poincaré’s analytic and topologic procedures and is supported 
and complemented by a considerable amount of experimental 
work. The other comprised Kryloff, Bogoliuboff, and their 
pupils, at Kiev, whose major work lies in their development of 
certain linearization procedures and in utilization of Linstedt’s 
analysis, as evidenced by use of these to solve numerous problems 
in nonlinear mechanics and nonlinear electric circuits. 

Although during this stage major attention focused on electron- 
tube devices, work also was done on nonlinear problems concerned 
with rotating electric machines (as evidenced in the work of 
Lyon and Edgerton on pull-out torques of synchronous machines) 
and on static circuits containing a nonlinear element such as an 
are or ferromagnetic inductor (as indicated in Pedersen’s work 
on subharmonic generation in iron-cored circuits). 

However, though attention largely centered on problems of non- 
linear electric circuits, some work was done on mechanical sys- 
tems, for example, in vibrating systems encompassing nonlinear 
springs. Also in this period appears the first work—in a modern 
sense—on nonlinear control systems as exemplified in Minor- 
sky’s early work (1922) on ship steering and in Hazen’s later 
considerations (1934) of relay-type servosystems. In considera- 
ble measure publication of Hazen’s paper may be said to have 
initiated the current era of automatic control-system theory. 
In the large, however, this era really got under way about 1940, 
under the press of military necessity. 

Thus, in the fourth stage of the development of nonlinear 
system theory, roughly, 1940-1950, a considerable attention 
centered on nonlinear aspects of control-system analysis. Mainly, 
the earlier efforts were bent in two directions: To the analysis of 
systems which are nonlinear by virtue of the deliberate inclusion 
of a major nonlinear element, as exemplified by the switching 
action in relay (off-on) servosystems, and to the analysis of 
systems which are essentially linear, but have undesired nonlinear 
aspects such as dead time, backlash, elastic hysteresis, satura- 
tion, and so on, of a “‘small’’ nature. This stage encompasses the 
work of (to name only some of the best-known non-Russian 
workers) MacColl, Oldenbourg and Sartorius, Oppelt, Tustin, 
Kochenburger, Dutilh, Loeb, and Cahen. In Russia, a very 
considerable volume of work was accomplished by Malkin and 
many others. 

The fifth and present stage of work originated about 1950: 
On one side, through the start of a considerable effort directed 
toward improving the performance of control systems by the 
deliberate inclusion of nonlinear elements and effects (as in Mac- 
Donald’s work on multiple-mode switching); on the other side, 
through attempt to study inherent nonlinear effects in control 
systems in the “‘large,’’ rather than just in the small wherefor 
linearization yields workable solutions. 


III Orrentation 


Now, as detailed in Section IV, the major nonlinear phenomena 
and methods of analysis observed or employed to date in the 
fourth and fifth stages of the time stream are paralleled, or rooted 
in, those observed or developed in the first three stages; whence 


TRANSACTIONS OF THE ASME 


the importance to the control engineer interested in nonlinear 
systems of gaining a good knowledge pertinent to the first three 
stages. A good orientation in the broad outlines of this work, 
through delineation of the major phenomena occurring in non- 
linear systems and by means of illustrative physical example 
without for the most part getting into any considerable mathe- 
matics, can be obtained by reading a certain group of papers. 
These are limited in number, easily obtained, highly readable, 
and provide a good background for embarking on a detailed 
study of the theory. 

The principal papers of this group, and their particular values, 
may be mentioned as follows: 


1 A comprehensive paper by van der Pol (1)? (1934), which 
gives an excellent chronologically based résumé of the main 
course of development of nonlinear electric-circuit analysis to 
that time, primarily as it concerns oscillation problems in radio 
engineering. 

2 A long paper by Mandelstam (2) and others (1935), which 
outlines the general course of development of nonlinear theory to 
that time, as developed in Russia by the Moscow-Leningrad 
group, and as it concerned oscillation problems in engineering. 

3 A lengthy paper by von Karman (3) (1940) which, though 
concerned largely with the development of nonlinear static 
mechanics, yet contains some account of nonlinear oscillatory 
problems. 

4 Two well-detailed papers by Cartwright (4, 5) (1949, 
1952). The first contains a gocd outline of the course of de- 
velopment of the basic analytic theory and the discovery of the 
major nonlinear phenomena. The second contains an interesting 
résumé of the paralleling advances in the United States, England, 
and Russia in the late 1930’s and early 1940's. 

5 An interesting paper by Bothwell (6) (1952) on the “pres- 
ent’’ state of nonlinear stability theory. 

6 A paper by Kestin and Zaremba (7) (1952) which provides a 
good introductory survey of topological analysis and nicely 
complements item 5. 

7 A paper by Weber (8) (1953) of interest in its account of 
certain nonlinear problems of electric-circuit analysis, aside from 
those of radio engineering. 

8 Minorsky’s recent account (9) (1954) of Poincaré’s influence 
on the development of nonlinear theory, as evidenced in the work 
of certain of the principal investigators to date. 

9 Finally, note may be made of certain shorter papers by 
van der Pol (10) (1948), Cartwright (11, 12) (1948, 1951), and 
Haacke (13) (1953) which complement the foregoing major 
items, and of the survey encompassed in the introduction to 
McLachlan’s book, mentioned later. 


IV DerarLtep Accounts or GENERAL THEORY 


With background thus gained on the course of development to 
date, the engineer interested in obtaining a firm grounding in 
nonlinear control theory should next turn to study of several 
(and subsequently all—if time permits) of the half-dozen prin- 
cipal collective treatments on nonlinear analysis published to 
date in English. As a guide to such study, the following brief 
characterization may be of interest: 


1 N. Kryloff and N. Bogoliuboff (14) (1943). This is a free 
translation and condensation of material from two monographs by 
these authors in the 1930’s. It provides a good introduction to 
both their methods based on Linstedt’s analysis and to certain 
procedures of linearization based on consideration of the physical 
aspects of a system. 

2 A. Andronow and 8. Chaikin (15) (1949). This is a free 


? Numbers in parentheses refer to the Bibliography at the end of 
the paper. 


APRIL, 1957 


translation and condensation of the 1937 Russian edition. It 
provides an introduction to topological procedures, to van der 
Pol’s and Poincaré’s methods, to solution by piecewise lineariza- 
tion, to graphical solution by isoclines, and to other aspects of the 
general theory. 

3 N. Minorsky (16) (1947). This is essentially a reissue in 
one volume of four 1944-1945 David Taylor Model Basin Re- 
ports. Primarily, this text comprises a connected account of 
Russian work as evidenced in items 1 and 2. It encompasses 
much material omitted in these English translations and con- 
densations, but it also omits some contained therein (especially 
in item 1). The text is supported throughout by discussion of 
numerous illustrative examples. A second, considerably ex- 
panded, edition is now in preparation. A good résumé of much 
of the work discussed in this book is provided in Minorsky’s (17) 
interesting paper, ‘“Modern Trends in Nonlinear Mechanics’’ 
(1948); see also Bellin’s (18) paper, ““Non-Autonomous Systems’’ 
(1953). 

4 N. W. MeLachlan (19) (1955). This enlarged second 
edition of the original 1950 text provides a good mathematical 
introduction to numerous analytic procedures. 

5 J.J. Stoker (20) (1950). A good introductory text, shorter 
and less detailed in analysis than is item 4. 

6 C. Hayashi (21) (1953). This text is devoted largely to 
exposition of forced oscillations (whereas the previous items are 
largely concerned with free oscillations) and encompasses accounts 
of corresponding experimental investigation and substantiation of 
various phases of the theory. 


Specrau Aspects or THE GENERAL THEORY 


Various special aspects of the general body of nonlinear theory 
are entailed in readily available published books and collections 
of notes. These may prove of interest and aid as noted: 


1 H. Poincaré (22) (1892). Poincaré’s own account of his 
method of small parameters is yet one of the best. 

2 A. Liapounoff (23) (1947). The 1947 photoreproduction 
of the 1907 corrected French translation of the original 1892 
work comprises a detailed account of one of the central features 
of nonlinear-oscillation theory, namely, Liapounoff’s stability 
criteria. In this connection, note Section III—5. 

3 C. Duffing (24) (1918). This book comprises a detailed 
study of what is now known as Duffing’s equation. 

4 P. Le Corbeiller (25) (1931). This contains a descriptive 
account of numerous nonlinear phenomena and of physical 
systems in which they occur. 

5 W. Hurewicz (26) (1943). These mimeographed notes 
provide an excellent introduction to certain topological aspects 
of the solutions of nonlinear problems 

6 Friedrichs, Stoker, Le Corbeiller, and Levinson (27) 
(1942-1943) and Eckweiler, Flanders, Stoker, Friedrichs, and 
John (28) (1946). Much of the essential content of these mimeo- 
graphed notes, of courses given, respectively, at Brown and at 
New York University, are encompassed in Stoker’s (20) book. 

7 L. Pipes (29) (1946). In this book, chapter 22, “The 
Analysis of Nonlinear Oscillatory Systems,’’ contains a good 
rephrasing of some aspects of Kryloff-Bogoliuboff’s theory of 
higher approximation in terms of Laplace transform theory. 

8 A. Madwed (30) (1950). This recently reprinted report 
provides the best available general account of the number- 
series method of solution, initiated largely by Tustin. 

9 Fligge-Lotz (31) (1953). This book stems from reports 
written in Germany during the last war in connection with 
design of control systems for guided missiles. It provides a 
good connected account of study of forced relay-type servo- 
systems by phase-space methods. 


447 


10 R. Bellman (32) (1953). This text contains an interesting 
account of certain aspects of the stability of nonlinear systems, 
as in chapter 4 on Poincaré’s and Liapounoff’s work. It en- 
compasses much of an earlier research report of essentially the 
same title. 


VI Principat Meruops or NONLINEAR ANALYSIS 


The books of Section IV on general theory and the comple- 
mentary specialized items of Section V provide, at best, only a 
basic introduction to some of the better-known and most-used 
methods of nonlinear analysis. After study of these texts, one 
who desires to know more of the advanced phases of the theory 
and uses of these methods, and of a considerable body of other 
theory and numerous useful other methods, would next ‘turn to 
the periodical literature. This is very extensive, indeed. If, 
however, attention is directed to that which is of especial value 
in regard to nonlinear control-systems work, a considerable limita- 
tion results. In fact, most of the consequential work is en- 
compassed in a few hundred titles. 

Lack of space forestalls discussion of even this selected list. 
It must suffice to remark that over the past several years the 
writer has compiled a rather exhaustive bibliography of the major 
published work on nonlinear analysis, as applicable for investiga- 
tion of engineering problems; has read the greater portion of this 
body of work; and is preparing a classified listing of those papers 
which cover theory and methods of analysis of interest in non- 
linear control-system work. 

However, indicative of the very considerable number and 
variety of such methods, many of which have not yet been 
employed in nonlinear control-system analysis, it is of value to 
enumerate here the principal available procedures (analytic, 
topologic, graphic, numeric, and computer-effected). Thus: 


1 Solution of the equations of performance in closed form in 
terms of known functions, such as elliptic or hyperelliptic func- 
tions. 

2 Sectional linearization (yielding linear differential equa- 
tions of performance holding over stated ranges of the independ- 
ent variables). 

3 Equivalent linearization, as effected by energy-balance, 
etc. 

4 Expansion in powers of one or more parameters charac- 
terizing small nonlinearities, as in Poincaré’s method. 

5 Kryloff and Bogoliuboff’s methods of first approximation, 
improved first approximation, and higher approximation as 
based on Linstedt’s procedure. 

6 Van der Pol’s method (which has much in common with 
the K-B method of first approximation, the former being couched 
in variable polar co-ordinates, whereas van der Pol’s is couched in 
variable rectangular co-ordinates). 

7 Minorsky’s stroboscopic method. 

8 Series expansion in powers of the input function. 

9 Inversion of series. 

10 Fourier expansions (for investigating steady-state oscilla- 
tions) 

11 Asymptotic-series expansions. 

12 Various methods of rather limited usefulness, such as, 
Duffing’s method, one-term approximations to a solution, and so 
on 


13 Use of nonlinear integral equations. 
14 Transform theory, enabling a simpler effectance of the 
analysis by the methods of items 5, 9, and 13. 

15 Various variational procedures, especially Galerkin’s 
method and Ritz’s method. 
16 Various iterative procedures. 
17 Number-series method. 


448 


18 Various numerical procedures, especially as formulated by 
Collatz. 

19 Various aspects of phase-plane and phase-space analysis 
(topological procedures). 

20 Numerous methods of graphical construction in the phase 
plane and phase space: Isoclines, Liénard’s, arc-segment pro- 
cedures, and so on. 

21 Graphical constructions of rather limited usefulness, such 
as Rauscher’s or Martiennsen’s methods. 

22 Miscellaneous procedures: Variation of parameters, 
successive integrations, Picard’s method of successive approxi- 
mations, method of collation, and so on. 

23 Analog-computer and digital-computer solution. 

24 Direct modeling in miniature. 


VII PHENOMENA OF PaRTICULAR INTEREST IN NONLINEAR 
SysTeMs 


Detailed study of the body of work discussed in Sections V and 
VI evidences that the major considerations in design or analysis 
of nonlinear systems are largely encompassed under one (or 
more) of the following: 


1 Determination of free or forced transient response of a 
system with stated parameters and prescribed initial conditions. 

2 Determination of free or forced steady-state response of a 
system with stated parameters and prescribed initial conditions, 
and, when a response is oscillatory, how the amplitude, fre- 
quency, period, and nature of this cyclic response depends on the 
stated parameters and initial conditions. 

3 Determination of bounds on the amplitude of steady- 
state cyclic response. 

4 Determination of the nature of the stability of steady-state 
cyclic response. 

5 Establishment of conditions enabling knowledge of the 
degree of accuracy to be expected from application of various 
procedures for effecting approximate determinations of steady- 
state, free or forced cyclic response, as by Martienssen’s or 
Schwesinger’s methods. 

6 Determination of subharmonic and multiple-harmonic 
steady-state cyclic responses, particularly if these dominate the 
fundamental in amplitude. 

7 Determination of interaction effects between two or more 
elements in a nonlinear system. 

8 Determination of jump phenomena, characterized by 
sudden increase in the value of a system response, expressed as a 
function of a parameter or an input function. 

9 Determination of various resonance phenomena, charac- 
terized by unusually large amplitudes of response stemming from 
certain critical combinations of parameters and/or initial condi- 
tions. 

10 Investigation of the causes, and means of causing or 
obviating, autoexcitation, parametric excitation, and other such 
effects. 


VIII Automatic ConTRoL SysTeMs 


In substantiation of the values of the foregoing program for 
obtaining a good grounding in the theory and methods of non- 
linear analysis, it is to be remarked that a detailed study of the 
literature published to date on nonlinear control systems reveals: 


1 That the various phenomena studied therein are en- 
compassed among those just listed in Section VIT. 

2 That essentially all of the analysis utilized for solution or 
study of specific nonlinear control problems is rooted in, or 
paralleled by, methods stated in Section VI. 


This body of published literature on nonlinear control systems 
is not prohibitively large. It comprises rather sketchy treatment 


TRANSACTIONS OF THE ASME 


in several general texts in English (the most comprehensive is, 
perhaps, that in Truxal’s (33) book); Fliigge-Lotz’s (31) text; 
several Russian books (detailed in the author’s (34) recent 
paper); and a body of several hundred periodical papers. Ac- 
cordingly, it is not difficult for the engineer especially interested 
in nonlinear control systems to master what has been published. 
As an aid to such effort, the author has prepared a classified 
listing of much of the consequential, non-Russian periodical 
literature on nonlinear control-system theory published to date. 
A copy may be obtained upon request. 


BIBLIOGRAPHY 


1 ‘The Non-Linear Theory of Electric Oscillations,” by B. van 
der Pol, Proc. Inst. Radio Eng., vol. 22, 1934, pp. 1051-1086. 

2 “Exposés des récherches recentes sur les oscillations non- 
linéaires,’’ (Account of Recent Researches on Nonlinear Oscillations) 
by L. Mandelstam, N. Papalexi, A. Andronow, S. Chaiken, and A. 
Witt, Tech. Phys., USSR, vol. 2, 1935, pp. 81-134. 

3 “The Engineer Grapples With Nonlinear Problems,” by T. von 
Karman, Bull. Am. Math. Society, vol. 46, 1940, pp. 615-683. 

4 ‘Non-Linear Vibrations,” by M. L. Cartwright, Adv. Sci. 
(British Asso. Adv. Sei.), April, 1949, pp. 1-12. 

5 “Non-Linear Vibrations: A Chapter in Mathematical History,” 
by M. L. Cartwright, Math. Gaz., vol. 36, 1952, pp. 81-88. 

6 “The Current Status of Dynamic Stability Theory,” by F. E. 
Bothwell, Trans. AIEE, vol. 71, part 1, 1952, pp. 223-228. 

7 “Geometrical Methods in the Analysis of Ordinary Differential 
Equations: Introduction to Nonlinear Mechanics,”’ by J. Kestin 
and 8S. K. Zaremba, Applied Science Research, series B, vol. 3, 1952, 
pp. 149-189. 

8 “Introduction to Nonlinear Physical Phenomena,” by E. 
Weber, Proc. Sym. Nonlinear Circuit Analysis, New York, N. Y., 
1953, pp. 1-27. 

9 “Influence d’Henri Poincaré sur l’'evolution moderne de la 
theorie des oscillations nonlinéaires"” (Influence of Henri Poincaré on 
the Modern Evolution of the Theory of Nonlinear Oscillations), 
by N. Minorsky, Mem. Soc. Ingrs. Civ. Fr., vol. 107, no. 111, July- 
September, 1954, pp. 108-205. 

10 “Mathematics and Radio Problems,” 
Philips Res. Report 3, 1948, pp. 174-190. 

11 ‘Topological Aspects of Forced Oscillations,’ by M. L. 
Cartwright, Research, vol. 1, 1948, pp. 601-606. 

12 ‘Non-Linear Vibrations,’’ by M. L. Cartwright, Journal Pi Mu 
Epsilon, April, 1951, pp. 131-137. 

13 “Uber die nichtlineare Mechanik” (On Nonlinear Mechanics), 
by W. Haacke, Phys. Blatter, vol. 9, no. 9, 1953, pp. 398-405. 

14 “Introduction to Non-Linear Mechanics,’ by N. Kryloff and 
N. Bogoliuboff, tr. by S. Lefschetz, Princeton University Press, 
Princeton, N. J., 1943, 105 pp. 

15 ‘Theory of Oscillations,"’ by A. Andronow and 8. Chaikin, 
tr. by N. Goldowski, Princeton University Press, Princeton, N. J., 
1949, 358 pp. 

16 “Introduction to Nonlinear Mechanics,’”’ by N. Minorsky, 
J. W. Edwards, Ann Arbor, Mich., 1947, 464 pp., second edition pend- 
ing publication. 

17 ‘Modern Trends in Nonlinear Mechanics,”” by N. Minorsky, 
Advances in Applied Mechanics, Academic Press, Inc., New York, 
N. Y., vol. 1, 1948, pp. 41-103. 

18 ‘‘Non-Autonomous Systems,”’ by A. I. Bellin, Advances in 
Applied Mechanics, Academic Press, Inc., New York, N. Y., vol. 3, 
1953, pp. 295-320. 

19 ‘Ordinary Non-Linear Differential Equations in Engineering 
and Physical Sciences,”” by N. W. McLachlan, Clarendon Press, 
Oxford, England, first edition, 1950, 201 pp.; enlarged second edi- 
tion, 1955. 

20 ‘Nonlinear Vibrations in Mechanical and Electrical Systems,” 
by J. J. Stoker, Interscience Publishers, Inc., New York, N. Y., 1950, 
273 pp. 

21 ‘Forced Oscillations in Non-Linear Systems,” by C. Hayashi, 
Nippon Printing and Publishing Company, Osaka, Japan, 1953, 164 


by B. van der Pol, 


p. 
22 ‘Les méthodes nouvelles de la mécanique céleste’’ (The New 
Methods of Celestial Mechanics), by H. Poincaré, Paris, France, 1892. 
23 ‘‘Probléme general de la stabilité du mouvement” (The General 
Problem of the Stability of Motion), by A. Liapounoff, Annals of 
5 Studies, no. 17, Princeton University Press, Princeton, 
. J., 1947. 
24 “Erzwungene Schwingungen bei verandlicher Eigenfrequenz” 


APRIL, 1957 


(Forced Oscillations With Arbitrary Eigentrequency), by C. Duffing, 
Braunschweig, Germany, 1918. 

25 “Les systémes autoentretenus et les oscillations de relaxation” 
(Self-Oscillating Systems and Relaxation Oscillations), by P. Le 
Corbeiller, Librairie Scientifique Hermann et Cie, Paris, France, 
1931, 46 pp. 

26 “Ordinary Differential Equations in the Real Domain With 
Emphasis on Geometric Methods,”’ by W. Hurewicz, Brown Univer- 
sity, Providence, R.I., 1943, 129 pp. 

27 “Nonlinear Mechanics,” by K. Friedrichs, J. Stoker, P. Le 
Corbeiller, and N. Levinson, Brown University Press, Providence, 
R. 1942-1943. 

28 “Studies in Nonlinear Vibration Theory,” by H. Eckweiler, 
D. A. Flanders, J. Stoker, K. Friedrichs, and F. John, Inst. Math. 
Mech., New York University, New York, N. Y., 1946. 

29 “Applied Mathematics for Engineers and Physicists,”” by 
L. Pipes, McGraw-Hill Book Company, Inc., New York, N. Y., 
1946—see chapter 22 ‘‘The Analysis of Nonlinear Oscillatory Sys- 
tems." 

30 ‘“‘Number Series Method of Solving Linear and Nonlinear 
Differential Equations,” by A. Madwed, Report No. 6445-T-26, 
Instrumentation Laboratory, Massachusetts Institute of Technology, 
Boston, Mass., 1950. 

31 “Discontinuous Automatic Control,” by I. Flagge-Lotz, 
Princeton University Press, Princeton, N. J., 1953, 168 pp. 

32 “Stability Theory of Differential Equations,”’ by R. Bellman, 
McGraw-Hill Book Company, Inc., New York, N. Y., 1953, 166 pp. 

33. ‘‘Automatic Feedback Control System Synthesis,”’ by J. G. 
Truxal, McGraw-Hill Book Company, Inc., New York, N. Y., 1955, 
675 pp. 

34 ‘Basic Books for Your Control Engineering Library, I. 
Servomechanisms,"’ by T. J. Higgins, Control Engineering, vol. 1, 
December, 1954, pp. 48-51. A complementary article is to appear 
in a 1956 issue of Control Engineering. 


Discussion 


RicHarp BeLLMAN.* It may be worth while, as an appendix to 
this very valuable summary of the classical and modern theory 
of nonlinear differential equations and its interrelation with elec- 
tronic and mechanical circuit theory, to discuss briefly some recent 
work which has been done by the writer and collaborators in con- 
nection with control processes. 

We may consider every scientific theory to have several stages 
of development. In the first stage we attempt to describe and 
predict physical phenomena arising from given systems; in the 
second stage we attempt to design systems so as to produce de- 
sired phenomena. Although, at first glance, it might seem that a 
thorough mastery of the first stage is essential to progress in the 
second stage, this is not necessarily true. In other words, some 
classes of optimization problems may be easier to tackle than re- 
lated classes of purely descriptive problems. The reason for this 
lies in the fact that variational properties possessed by optimal 
systems in many cases greatly simplify the analytic structure of 
the equations which describe the system. 

A class of mathematical problems which arise in economic as 
well as engineering context is the following: Consider a nonlinear 
vector system of differential equations 


g(z, f(), x(0) =c 


where z(t) represents the state of a system at time ¢ and f(t) is a 
control vector which is to be chosen so as to have the system per- 
form in some desired way. An example of some interest in recent 
years is that of “on-off’’ or “bang-bang”’ control, where the com- 
ponents of f(¢) assume only values +1 and are to be chosen so as 
to minimize the time required for the system to return to the 
equilibrium state r = 0. 
If the system is linear inhomogeneous 


3 Rand Corporation, Santa Monica, Calif. 


3. = Ax + f(t), 


z(0) = 

the problem may be treated in various ways and explicit solutions 
obtained (cf. Bellman, Glicksberg, Gross,‘ where other references 
may be found). 

If the system is nonlinear, variational problems arise which 
escape classical techniques in general. In a number of cases, how- 
ever, they may be resolved computationally, with the aid of 
modern high-speed computers, using the techniques of the theory 
of dynamic programming, Bellman,** and the new approach to 
the calculus of variations furnished by this theory. 

A discussion of some of the applications of this method to the 
general problem discussed in the foregoing is contained in Bell- 
man.’ 


K. G. Buiack.’ This is an interesting and informative paper. 
It should be of considerable value to all engineers who encounter 
control problems in which nonlinearities are present. Many con- 
trol engineers are familiar with certain nonlinear techniques which 
they have found useful in their work, and the excellent list of non- 
linear methods presented in the paper should facilitate the selec- 
tion of others. The author’s considerable experience in teaching 
courses in both nonlinear and control theory has enabled him to 
suggest a program of study which offers both an aid and a challenge 
to the control engineer. 


T. H. Cuin.* To the analytical study and application of the 
theory of nonlinear control systems, the author makes a most wel- 
come and informative contribution by presenting a concise inte- 
grated summary of the progress and of the development of non- 
linear control theory to date. 

The author emphasizes the possibility of employment of non- 
linear theory and techniques to enable design of control systems 
which yield maximal performance. He provides a most valuable 
outline of how to go about obtaining the knowledge of this 
theory. 

In recent years, as is well known, the determination of the per- 
formance of a control system has been based largely on fre- 
quency-response analysis and transient-response analysis. These 
techniques are most useful for analyzing highly linear systems. 
However, when a very high degree of accuracy is desired, then, 
because most physical systems are wholly linear, appropriate 
attention must be given to the system nonlinearities resulting 
from backlash, starting friction, hydraulic leakage, thermal ex- 
pansion, dielectric breakdown, amplifier distortion, resistor ther- 
mal dependency, and many other effects. 

It has been shown both analytically and experimentally that 
the utilization of such simple nonlinear theory as phase-plane 
analysis can embrace certain nonlinear effects in second-order 
feedback-control systems and lead to substantial improvement 
in performance. Evidentiy a very broad knowledge of nonlinear 


‘On the Bang-Bang Control Problem,’’ by R. Bellman, I. Glicks- 
berg, and O. Gross, Quarterly of Applied Mathematics, vol. 14, 1956, 
pp. 11-18. 

5“*The Theory of Dynamic Programming,’”’ by Richard Bellman, 
Bulletin of the American Mathematical Society, vol. 60, 1954, pp. 503— 
516. 

***Principles of Dynamic Programming,’”’ by Richard Bellman, 
Princeton University Press, Princeton, N. J., in press. 

7“On the Application of Dynamic Programming to the Study of 
Circuit Analysis,’’ by Richard Bellman, Symposium on Nonlinear 
Processes, Brooklyn Polytechnic Institute, Brooklyn, N. Y., April, 
1956. 

* General Electric Company, Schenectady, N. Y. 

® Assistant Professor, Department of Electrical Engineering, Uni- 
versity of Pittsburgh, Pittsburgh, Pa. 


450 


theory, such as outlined by the author, would enable correspond- 
ing successful study of much more complex systems. Another 
powerful tool in dealing with nonlinear problems is the use of elec- 
trical analogies in which nonlinear effects due to mechanical, ther- 
mal, hydraulic, and pneumatic components can be solved con- 
veniently by studying analogous electrical systems through the 
aid of analog computing devices. It may be noted that detailed 
information on one form of electrical-analog techniques is given in 
the author’s paver.!” 

Certainly, :u dealing with nonlinear problems, adequate pos- 
session of the knowledge outlined by the author will often enable 
a satisfactory solution of nonlinear control problems, which other- 
wise would be unsolvable. 


R. N. Crark."! There is little question that nonlinear analysis 
will continue to be of increasing importance in the control-system 
industry. There are many engineers now in industry who will 
have to become schooled in nonlinear mechanics without the 
benefit of formal education. The definition and identification of 
the various aspects of this subject that are presented in the paper 
should prove invaluable in orienting those engineers. The author 
is to be congratulated on a splendid presentation of a comprehen- 
sive job of library research and subject classification. 


G. H. Ferr.'* 
of nonlinear control-systems theory. 
catalog of methods of analysis. 

To his list of 24 methods of analysis that he gives in Section VI 
several combinations of interest to servomechanism theory can be 
added. Topological analysis represents a fruitful area of study 
when supplemented by the use of computers, a combination of 
Nos. 19 and 23. For example, the operation of the control system, 
dynamically, can be envisioned as the trajectory in a phase space 
which gives the error displacement, velocity, and acceleration for 
a third-order system, and this trajectory can be analyzed. Now, 
if it is desired to bring the system back to equilibrium, then the 
distance from the trajectory to the origin should be reduced to 
zero in the shortest period of time'* with due consideration to the 
abilities of the prime mover. With the aid of a linear transforma- 
tion and a computer's a method for obtaining this characteristic 
can be devised. 

Another combination method of attack is represented by the 
use of power-series expansion of the nonlinear elements in terms of 
one or more of the variables, a modification of No. 8, coupled with 
the Fourier expansions of No. 10. This method" is particularly 
useful in establishing average and instant instability criteria in 
maultiple-loop systems. 

Equivalent linearization, No. 3, can be used to great advantage 
to predict transient performance of systems with specific types of 
nonlinearities such as backlash, saturation, and hysteresis."* 

A method of synthesis can be extracted from the methods listed 


The author has carefully classified the literature 
Of special interest is his 


‘‘Electroanalogic Methods,”"’ by T. J. Higgins, Applied Mechanics 
Reviews, vol. 9, 1956, January, pp. 1-4; February, pp. 49-55. 

11 Research Engineer, Aeronautical Division, Minneapolis-Honey- 
well Regulator Company, Minneapolis, Minn. 

12 Department of Electrical Engineering, University of Illinois, Ur- 
bana, III. 

18 ‘‘Metrization of Phase Space and Nonlinear Servo Systems,” by 
C. L. Kang and G. H. Fett, Journal of Applied Physics, vol. 24, 1953, 


. 38. 
r 14‘*Higher Order Servos With Nonlinear Computer Using Phase 
Space Techniques,”” by E. J. Hagin, PhD thesis, University of Il- 
linois, 1956, to be published. 

%**A Phasor Method of Nonlinear Analysis,’’ by J. P. Neal, PhD 
thesis, University of Illinois, 1955, accepted for publication by the 
American Institute of Electrical Engineers, paper 56-804. 

4 “Feedback Control Systems,” by G. H. Fett, Prentice Hall, Inc., 
New York, N. Y., 1954, pp. 330-345. 


TRANSACTIONS OF THE ASME 


under Nos. 23 and 24. A nonlinear control system is set up on a 
computer with some particular type of nonlinearity. The output 
is then observed as the input is varied, as the nonlinearity is 
varied, and as other parameters are changed. Then the solution 
to the nonlinear system is known, and if the resultant solution is 
similar to the desired one, a basis for synthesis is developed. A 
catalog of solutions thus is used as a reference for design. 

As an added item to the bibliography which Professor Higgins 
has provided, the writer would suggest the addition of Volumes II 
and VI of the MRI Symposia Series published from the papers 
given at New York meetings of the Professional Group on Circuit 
Theory of the IRE, in 1953 and 1956. 


G. F. Forses.” With reference to item 23, section VI, of 
the author’s paper, commercial digital differential analyzers, 
(DDA), are now available in the price class of about ten-thousand 
dollars. Such machines are mathematically equivalent to the 
mechanical differential analyzer. They have much greater ca- 
pacity and all of the convenience with respect to the nonlinear 
terms in the differential equation. : 

Availability of such machines, on a production basis, indicates 
a potential radical change in the philosophy and approach to non- 
linear system design. In many cases, the cost of the DDA is less 
than the cost of building a trial model of the nonlinear system. 
Any nonlinear device for which there is any possibility of an 
analytic solution can be simulated easily on the DDA. Nonlinear 
systems, for which no analytic approach is available, usually offer 
no difficulty to exact simulation. 


J. D. Granam." In this paper the author makes a definite 
contribution to the young worker in the field of nonlinear control 
systems. By reading this résumé the person not versed in this 
subject can survey the field with a minimum reading and have an 
over-all picture of the methods and applications of nonlinear 
analysis. The paper also makes the suggestion that work is still 
to be done in applying analysis to control systems that already 
have been developed for nonlinear mechanics and electrical cir- 
cuits. 

In reading the material presented it should be remembered that 
the present state of the nonlinear-analysis art is not wholly satis- 
factory for all the methods presented have serious limitations 
such as accuracy, length of calculations, and area of application. 
Thus workers in nonlinear control systems are still challenged to 
effect new and more satisfactory methods in addition to using 
methods known in other fields. 


J.D. Horaan.'* To the engineer who has embarked on a pro- 
gram of self-study in nonlinear control-system theory, or who 
plans to begin such an effort soon, the prospects of finding and 
evaluating the countless references on the subject is overwhelm- 
ing, perhaps completely discouraging. Thus it is that this paper, 
which comprises, in fact, a plan of study, should be of great value. 
By giving a brief sketch of the historical background, theory, 
problems, and methods of solution, the author has provided a 
framework to which future studies may be referred, enabling the 
engineer to make an integrated approach to the subject. By 
directing the reader’s attention to certain key papers and books, 
the author has indicated how the limited time available for such 
studies can be used to best advantage. 

The emphasis in this presentation on historical background 


" Industrial Mathematician, Pacoima, Calif. 

8 Assistant Professor, Department of Electrical Engineering, Kan- 
sas State College, Manhattan, Kans. 

1* Assistant Professor in Electrical Engineering (on leave, Mar- 
quette University, Milwaukee, Wis.); at present, Graduate Student, 
University of Wisconsin, Madison, Wis. 


APRIL, 1957 


should be noted by all engineers, but particularly by engineering 
teachers. It is feared that this is a facet of engineering which is 
too often neglected in the classroom. Yet, it can give life and 
meaning to a topic which otherwise might appear to be drab and 
unimportant. 


Y. H. Ku.” As nonlinear analysis has gone through a great 
deal of development in the recent years, the author’s résumé is 
most welcome. To quote van der Pol from his discussion on ‘“‘Some 
New Concepts and Theorems Concerning Nonlinear Systems,”’ 
by Cherry and Millar:** 

“After 1920 technicians soon became interested in these non- 
linear differential equations so that soon the whole field was 
studied intensively in the U.S. A. and there the practical impor- 
tance of these nonlinear differential equations was soon recog- 
nized. Then the war came. I was in Holland all the time during 
the Occupation so that we did not hear what happened outside in 
the civilized world, but after the war the door was opened and I 
was happily surprised to note that especially in France, Great 
Britain, and also in the United States, a large amount of work had 
been done and was being done by pure mathematicians during 
this time in the field of nonlinear differential equations.’’ 

Minorsky’s paper on “Nonlinear Control Systems’’?! made the 
following comments on the van der Pol equation: 

“The connection between this discovery and the earlier work of 
Poincaré was ascertained only nine years later. From that 
moment on the progress in this new field, nonlinear mechanics, 
became systematic and rapid. Between 1930 and 1939 these 
studies were conducted almost exclusively in the USSR but 
after the end of the last war they were taken up also in the United 
States and in England.” 

Reference (17) of the paper contained 44 references. Keller’s 
1941 AIEE paper®? contained a bibliography of a hundred entries. 

To these additional references, the writer likes to supplement 
the contributions edited by Lefschetz** and the writer’s recent 
paper* on the phase-space method. 


R. E. Kusa.* The author has provided a valuable and com- 
prehensive paper. It is interesting to note that even though the 
beginnings of nonlinear control theory date back nearly a cen- 
tury, one may still encompass the entire body of consequential 
nonlinear control literature by consulting only a few score major 
works. This should act as a stimulus to bring new workers into a 
field loaded with research and development possibilities. 

Compiling an accurate bibliography and evaluating its basic 
characteristics is all too often a thankless task; therefore, on be- 
half of the nonlinear control people, the writer wishes to thank 
the author for the great deal of personal time and effort he has 
devoted to this end. 


Roir W. Lanzxron.* In this paper the author shows very 
clearly that the techniques and methods of analysis used for solv- 


2 Moore School of Electrical Engineering, University of Pennsyl- 
vania, Philadelphia, Pa. 

21‘*Automatic and Manual Control,’’ papers contributed to the 
Conference at Cranfield, 1951, Butterworths Scientific Publications, 
London, England, 1952. 

22 ‘Analytical Methods of Solving Discrete Nonlinear Problems in 
Electrical Engineering,’’ by E. G. Keller, Trans. AIEE, vol. 60, 1941, 
pp. 1194-1200. 

23 ‘Contributions to the Theory of Nonlinear Oscillations,”’ edited 
by S. Lefschetz, Princeton University Press, Princeton, N. J., vols. 1 
and 2, 1950 and 1952. 

24 ‘Analysis of Nonlinear Systems With More Than One Degree of 
Freedom,” by Y. H. Ku, Journal of The Franklin Institute, vol. 259, 
1955, pp. 115-131. 

% Associate Professor, Department of Electrical Engineering, 
Wayne University, Detroit, Mich. 

2° Remington-Rand Univac, St. Paul, Minn. 


451 


ing problems in nonlinear mechanics, and especially, nonlinear 
electric-circuit theory, provide powerful means of solving prob- 
lems in the field of nonlinear control systems, and that there are 
many such means which have not yet been used in control work. 

The author is to be commended for his excellent bibliography. 
It will be an aid to both the young engineer who is working in the 
field of nonlinear problems and wishes to get acquainted in general 
with some of these methods of solution, and the engineer long in 
the field who is trying to solve a specific problem. This is because 
the author has given a bibliography in which not only the names 
of specific books and the locations of certain articles are given, but 
also a useful discussion of the problems therein, the methods used 
in their solution, and the special values of each book or article 


N. Mryorsxy.” The author is to be congratulated for his 
presentation of the whole field of nonlinear problems and methods 
in such a short and, at the same time, clear manner. No less im- 
portant is his statement regarding the advisability of basic 
studies in the theory of nonlinear differential equations for those 
who desire to work on nonlinear-control problems. 

Very often one observes, particularly on the part of beginners, 
a tendency to consider nonlinear problems as something which 
merely requires a correction in the solutions of the corresponding 
linear problems. This tendency is due undoubtedly to the exist- 
ing linear theory of control systems which yields itself to argu- 
ments of this kind inasmuch as a continuous variation of co 
efficients of a differential equation generally accounts for a con- 
tinuous variation of its solutions. This circumstance found a 
widespread application in all kinds of diagrams of transfer loci 
obtained owing to the use of Laplace’s transform, In the non- 
linear field all this becomes very doubtful and frequently even 
entirely wrong. The problems are full of pitfalls if looked at from 
the viewpoint of the familiar linear theory. A certain phe- 
nomenon may either appear or disappear abruptly if a certain 
critical value of a parameter in the differential equation is 
reached. If one tries to analyze the effect (i.e., the solution of the 
differential equation), it is impossible to understand what hap- 
pens and still less to master the phenomenon whose cause is con- 
cealed in the properties of the differential equation. The linear 
tools like Laplace’s transform, so useful in the theory based on 
linear differential equations, cease to be applicable in the nonlinear 
field. Likewise, the Nyquist diagram of stability does not apply 
any more, inasmuch as one has to use in the nonlinear field criteria 
associated with the variational equations of Poincaré, and so on. 

All this is due to the fact that the existing theory of control is 
based (sometime in a rather implicit form) on the theory of dif- 
ferential equations in the complex domain, while the whole 
nonlinear field belongs to differential equations in the real domain. 
Or, as is well known, beginning with singularities, everything is 
different in these two domains and there are no means whatever to 
establish any connection between them. 

The only way out from this situation is to start from the funda- 
mentals, as the author suggests, and build the theory of nonlinear 
control systems on the nonlinear basis. This may appear some- 
what disappointing for those who have acquired a habit of using 
the existing control theory, but such situations are inevitable in 
any theory when it has to be overhauled under the impact of new 
experimental facts. 

In this particular case the situation is not so difficult as it may 
appear from an offhand consideration. In fact, there are many re- 
sults from the theory of oscillations which can be transferred into 
the theory of nonlinear control systems. Thus, for instance, an 
engineer dealing with the theory of oscillations may be interested 
in establishing conditions for the existence of a limit cycle (i.e., 
stationary oscillation). A control engineer, on the other hand, 


7 Professor Emeritus, Stanford University, Stanford, Calif. 


= 


452 


would be interested, on the contrary, in establishing conditions 
for the nonexistence of such cycles. In fact, in the first case a 
stationary oscillation is the desired goal while in the second case it 
is objectionable. Here the problem is the same but the purposes 
are different. The basic studies of this kind are obviously useful 
for further generalizations without which progress is impossible. 
Thus, for instance, instead of limiting oneself to the problem of 
establishing what happens in the presence of a given nonlinearity, 
one can foresee a much broader problem, namely, what kind of 
nonlinearity is to be introduced into the control system in order 
to predetermine its performance in a manner given in advance. 

Such problems of synthesis are vet in a very early stage of their 
development. 


D. H. Moore.” It is as if the author had written this paper for 
the writer’s personal benefit, since it fits his present need so well. 
The writer’s concern is to find a problem in nonlinear theory for 
a PhD thesis. There is a great deal of literature research to ac- 
complish and the writer will be greatly assisted by the author’s 
list of principal papers with comments on their particular values. 
It is heartening to see this important work so neatly presented 
and to find that the frontier is so nearly within reach. 


G. J. Murpny.” This excellent and time!y résumé should be 
of great value to the scientist who has been actively engaged in 
the analysis and synthesis of nonlinear control systems as well as 
to one who is about to enter this field. The thoroughness of the 
search which resulted in the publication of this paper is com- 
mendable, and the author’s comments and suggestions concerning 
the relative values of the listed items can well result in the saving 
of much time for the reader. 

However, although the procedure suggested in Section VIII of 
the paper for familiarization with published techniques is to be 
recommended to the research scientist, it is likely to be too time- 
consuming for the average engineer interested in acquiring a 
reasonable familiarity with practical methods of treating fre- 
quently encountered nonlinearities. For the latter group a more 
reasonable (and yet very profitable) approach is to study thor- 
oughly Chapters 10 and 11 of Truxal’s book® and then Mc- 
Donald’s work.*!; #2 Completion of such a program will provide 
the desired familiarity with practical approaches in a minimum 
of time and can always serve as a good basis from which to proceed 
to further study as desired. 


L. M. VauLese.** The author gives a clear and informative 
picture of the history and status of nonlinear theory. His com- 
petent critical appraisal of the various methods of analysis and of 
the results so far obtained should encourage those who are not yet 
familiar with nonlinear theory to undertake its study with con- 
fidence. 

There are various reasons for the scarce diffusion of such studies 
among engineers; the lack of a unified theory, the imperfection of 
the results, and the difficulties of application to synthesis prob- 
lems are perhaps the most significant ones. 

The theory has been developed by people with vastly different 


* Santa Monica, Calif. 

29 Assistant Professor of Electrical Engineering, University of 
Minnesota, Minneapolis, Minn. 

‘Automatic Feedback Control-System Synthesis,’’ by J. G. 
Truxal, McGraw-Hill Book Company, Inc., New York, N. Y., 1955. 

31**Nonlinear Techniques for Improving Servo Performance,” by 
D. McDonald, Proceedings of the National Electronics Conference, 
vol. 6, 1950, pp. 400-421. 

32“*Multiple Mode Operation of Servomechanisms,’”’ by D. Mc- 
Donald, Review of Scientific Instruments, vol. 23, January, 1952, pp. 
22-30. 

38 Associate Professor of Electrical Engineering, Polytechnic Insti- 
tute of Brooklyn, Brooklyn, N. Y. 


TRANSACTIONS OF THE ASME 


interests, such as astronomers, pure and applied mathematicians, 
mechanical, electronic, control, acoustical engineers, and so on 
As a result the reader must digest papers with unfamiliar ter- 
minology, and then find out whether they may be applied to his 
own problems. In addition, while the perfection of linear theory 
has permitted its rationalization into ‘“‘symbolic’’ or ‘‘operational’’ 
methods, which do not require thinking, the approximate nature 
of the procedures and results of nonlinear analysis make it im- 
perative to control continuously the physical significance and the 
mathematical correctness of the various steps. Finally, the tech- 
niques of application to synthesis problems are quite different for 
linear and nonlinear cases. This is due to the fact that only in 
few instances nonlinear elements may be synthesized, while more 
often they constitute fixed components of the design. As a result, 
rarely one finds ready available analyses, and more often a direct 
study is necessary in order to derive the conditions of optimum 
design. 


Ernst Weser.** The author is to be congratulated on the 
thoroughness with which he has explored the history of nonlinear 
control-system theory and the comprehensive enumeration of the 
methods of analysis. 

This contribution will be very helpful to all who are interested 
in the existing literature on nonlinear analysis. In fact, the writer 
has direct use for it in connection with his own work in nonlinear- 
circuit theory. 


AvuTuHor’s CLOSURE 


The author is most appreciative of the favorable comment by 
Professors Chin, Graham, Horgan, Ku, Kuba, Murphy, Vallese, 
and Weber and by Messrs. Black, Clark, Lanzkron, and Moore on 
the general values and usefulnesses of his paper. Mr. Bellman’s 
note of recent developments which render dynamic programming 
a powerful method of nonlinear system analysis, Professor 
Fett’s remark of combinations and particular applications of some 
of the methods listed in Section VI, and Mr. Forbes’ statement of 
the recent commercial availability of digital differential analyzers 
complement, very pertinently, certain items in the paper. 

Professor Minorsky’s interesting discussion reaffirms and sub- 
stantiates the author’s own thesis, namely, that an accurate and 
complete insight to the behavior of nonlinear systems can be 
gained only by a thorough study and analysis of the actual non- 
linear systems and associated solution of the corresponding non- 
linear differential equations of performance; and not by approxi- 
mating them by linear systems—althoug) this last procedure may 
provide useful information pertinent to certain limited condi- 
tions of operation. It is precisely on the basis of this thesis, 
stressed both by him and by Professor Minorsky, that the author 
must differ with the remark in the last paragraph of Professor 
Murphy’s discussion. Thus Chapters 10 and 11 of Truxal’s 
book™ give, respectively, a good account of the so-called describ- 
ing-function technique and an introduction to some topological 
aspects of nonlinear analysis; and MacDonald’s two papers*!*? 
afford an introduction (they have been followed by some 15 papers 
extending his initial work) to multiple-mode optimization of 
relay-type servomechanisms. But the whole of the knowledge 
encompassed in these just-mentioned sources is only a minute 
fraction of the existing total of nonlinear-system analysis which 
is available for use in control-system design and synthesis; and 
it is to the engineer who desires to make serious use of this avail- 
able total, rather than merely learn fragments of it, that this 
paper is addressed. 

Finally, the author extends his sincere thanks to all of the dis- 


** Head, Department of Electrical Engineering, Polytechnic Insti- 
tute of Brooklyn, Brooklyn, N. ¥. 


| | 
| 


APRIL, 1957 


cussers for the complimentary “temarks and helpful comments 
encompassed in their discussion as a whole. 

In conclusion, the author would direct attention to the recent 
excellent survey paper by Clauser® (to be considered as an addi- 
tional item relative to Section III on Orientation); would 


% “The Behavior of Nonlinear Systems,” by F. H. Clauser, Jour- 
nal of the Aeronautical Sciences, vol. 23, 1956, pp. 411-434. 


453 


stress that the Bibliography comprises books, survey articles, 
and special treatments of broad phases of theory as discussed in 
the context, and is not intended to include papers on specific 
problems or particular analyses; and would note that a separate, 
very exhaustive bibliography of some 400 such papers is availa- 
ble (from the Engineering Experiment Station, University of 
Wisconsin, Madison, Wis.) on request, as mentioned in the last 
paragraph of the paper. 


| 
| 
| 
| 
{ 
| 


| 

i 

| 

i 


Electrohydraulic Servomechanism With an 
Ultrahigh-Frequency Response 


By D. P. ECKMAN,? C. K. TAFT,? anv R. H. 


This paper presents an analysis of a positional servo- 
mechanism of very high performance. This control 
employs an electronic error transducer which actuates a 
pilot valve by means of an amplifier and a torque motor 
to position a hydraulic cylinder. The control was to 
have a frequency response whose amplitude ratio was 
nearly 1 to 200 cycles per sec (cps) at an amplitude of 
0.001 in. with 100-lb dry friction and a load mass of 200 Ib. 
These specifications were exceeded by the control described 
herein. The optimum open-loop gain and the closed-loop 
frequency response are determined by linearizing the sys- 
tem equations and using Laplace transform methods. 
The system also was analyzed by solving the nonlinear 
equations on an electronic analog computer to determine 
optimum gain, transient response, and frequency re- 
sponse. A comparison of the results indicates that for 
input signals which cause the control to operate outside 
the region in which the linearizing assumptions apply, the 
linear analysis still gives results which agree with those of 
the nonlinear analysis within a factor of three. 


NOMENCLATURE 
The following nomenclature is used in the paper: 


equilibrium value of valve spool nozzle opening, in. 
area of large side of piston, sq in. 

area of large end of valve spool, sq in. 

equilibrium opening of nozzle 2 in valve, in. 
isothermal bulk modulus of oil, psi 

displacement of piston from equilibrium position, in. 
transformed cylinder displacement, volts 
dimensionless cylinder velocity 

diameter of spool nozzle 1, in. 

diameter of valve nozzle 2, in. 

valve-spool diameter at ports, in. 

error signal from pickup, volts 

error = —r, in, 

= transformed error signal, volts 

= transformed error = C — R, volts 

= cross cylinder force, lb 

f. = cylinder compressibility frequency, cycles per sec (cps) 
J, = friction force on piston, lb 


A, 
A 


Be 


wm, 


1 Portions of this paper were taken from Mr. Taft's thesis, ‘An 
Analysis of an Electrohydraulic Positional Servomechanism,”’ 
submitted in partial fulfillment of the Degree of Master of Instru- 
mentation Engineering at Case Institute of Technology, Cleveland, 
Ohio. Information concerning the complete thesis may be obtained 
from the Librarian, Case Institute of Technology. 

2 Professor of Mechanical Engineering, Case Institute of Tech- 
nology. 

+ Research Engineer, Warner and Swasey Company. Assoc. Mem. 

4 Engineering Physicist, Warner and Swasey Company. 

Contributed by the Instruments and Regulators Division of THE 
American Society oF MecHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Janu- 
ary 10,1956. Paper No. 56—IRD-8. 


455 


SCHUMAN,‘ CLEVELAND, OHIO 


constant force on piston, lb 

transformed piston friction force, volts 

transformed piston constant force, volts 

dimensionless gross cylinder-output force 

flow coefficient, in?/see Ib'/* 

change of flow rate through nozzle 1 in spool with valve 
opening, in?/sec 

change of flow rate through nozzle 1 in spool with p,, 
in*/lb sec 

change of flow rate through nozzle 1 in spool due to 
change in supply pressure, in*/lb sec 

change of flow rate through nozzle 2 in valve due to 
torque-motor displacement, in?/sec 

change of flow rate through nozzle 2 in valve due to 
change in p,, in*/Ib sec 

change of cylinder force with cylinder velocity at con- 
stant valve opening, lb sec/in. 

change of cylinder force with valve opening at constant 
cylinder velocity, lb/in. 

change in flow rate to cylinder with cylinder pressure, 
in’ /Ib sec 

change in flow rate to cylinder with valve opening, 
in*/sec 

. Ama 

torque-motor-amplifier gain, —— 

torque motor-differential transformer gain, volts/in. 

torque motor-feedback amplifier gain, volts/volt 

pickup-tube gain, volts/in. 

torque-motor gain, in/A ma 

pilot-valve gain, in/in. 

load mass, Ib sec?/in. 

valve-spool mass, lb sec*/in. 

oil pressure on large side of piston, psig 

equilibrium value of p,, psig 

deviation of p, from p,, psig 

transformed cylinder pressure, volts 

supply pressure, psig 

deviation of p, from equilibrium value, psi 

oil pressure on large side of valve spool, psig 

deviation of p, from equilibrium value, psig 

oil-flow rate to cylinder, in*/sec 

oil-flow rate through nozzle 1 in valve spool, in*/sec 

equilibrium value of Q,i, in?/sec 

oil flow through nozzle 2 in valve block, in*/sec 

equilibrium value of Q,2, in’/sec 

oil-flow rate through valve-supply port, in*/sec 

equilibrium value of Q,:, in*/sec 

oil-flow rate through valve drain port, in*/sec 

equilibrium value of Q,2, in*/sec 

reference input, in. 

transformed reference input, volts 

Laplace operator 

transformed Laplace operator 

equilibrium piston position measured from position when 
minimum clearance exists on the valve side of the 
cylinder, in. 


F, = 
F,= 
j= 
K= | 
= 
K; = | 
Ky 
Ks = | 
= 
K; 
K, = 
Ky, = 
K, = 
K, = 
K, = 
K, = 
M, = 
M,= 
= 
Pa = 
Pe’ | 
P, = 
P, = 
= 
= 
= 
a = 
Qn = 
Qro = 
Qn = 
= | 
Qu = 
Quo 
Qu = 
Qm = 
Tr = 
R = | 


cylinder compressibility time constant, sec 
cylinder velocity time constant, sec 
torque-motor time constant, sec 
pilot-valve time constant, sec 
pilot-valve underlap, in. 
volume of oil in large side of cylinder and in line from 
valve to cylinder, cu in. 
minimum value of V, cu in. 
valve displacement, in. 
= dimensionless valve displacement 
transformed valve motion, volts 
maximum valve motion, in. 
torque-motor displacement, in. 
maximum torque-motor motion, in. 
transformed torque-motor displacement, volts 
pickup-linkage ratio 
cylinder damping ratio 
torque-motor damping ratio 
valve damping ratio 


r 


generalized equilibrium cylinder position = s + re 


in. 
transformed equilibrium position, volts 
oil mass density, lb sec?/in‘* 
transformed computing time, sec. 


INTRODUCTION 


The trend today is toward increased automation to speed up 
production processes. Many processes require high-power and 
high-performance positional controls in order to be automatized 
and this is especially true in the field of machine tools. Hydrau- 
lic controls employing hydraulic cylinders and valves can be 
designed to deliver large amounts of power with good dynamic 
performance. However, a mechanical input is often required, 
such as the displacement of a cam follower riding on a cam. 
Electronic components may be used along with the hydraulic 
cylinder and valve to permit more flexible and compact arrange- 
ment of error-sensing devices, remote location of error-sensing 
devices, dynamic compensation of the system transfer character- 
istics, easily adjustable loop gain, and electrical reference inputs. 
Thus a more versatile control can be designed using both elec- 
tronic and hydraulic components. 

The electrohydraulic positional servomechanism that is to be 
analyzed uses electronic components in the low-power input 
section and hydraulic components in the high-power output sec- 
tion. 

In this paper will be described the methods of linearization and 
linear analysis and the use of the electronic analog computer in 
solving the nonlinear equations for the dynamic performance 
of the control. The region of validity of the linear analysis is 
indicated in the comparison of the results obtained by the two 
methods of analysis. 

The design of high-performance controls has brought forth 
many problems. Resonances in the open-loop frequency re- 
sponse due to load mass and oil compressibility, along with 
resonances of the control-mounting structure, can limit the re- 
sponse of high-performance positional controls. Any resonant 
frequencies of the elements in the control loop which are near 
or below the desired cutoff frequency can cause this type of 
servomechanism to be unstable except at very low open-loop 
gains. When the open-loop gain is low, good closed-loop per- 
formance is impossible at high frequencies. The problems in- 
volved in control stabilization with mount structural resonances 
and oil compressibility resonances are the same from a control 
standpoint. 


TRANSACTIONS OF THE ASME 


LINEAR ANALYsIS 


The transfer function of each element in the control will be 
derived from the differential equations. The open-loop fre- 
quency-response function will be obtained from the transfer 
functions and will be used to determine the optimum open-loop 
gain. 


TORQUE 
MOTOR AND 
AMPLIFIER 


ERROR 
| TRANSDUCER 


Fie. 1 Brock Diacram or ELECTROHYDRAULIC SERVOMECHANISM 


The control is shown in block diagram form in Fig. 1. The 
error transducer is mounted on the piston of the hydraulic cylin- 
der and contacts a template. When an error exists between the 
template and piston positions, the transducer output voltage 
changes. This voltage change is the input to a torque-motor 
amplifier whose output is a difference in current to the coils of a 
torque motor. This differential current displaces the torque- 
motor armature. The motion of the armature is detected by a 
differential transformer whose output is a modulated voltage 
proportional to armature displacement. This voltage is de- 
modulated, amplified, and subtracted from the transducer out- 
put-voltage change to correct for armature-position errors. 
The torque-motor armature controls the opening of a nozzle 
which is in series with a nozzle in the valve spool whose opening 
depends on spool position. A motion of the torque-motor 
armature causes an unbalance of the pressure-area forces on the 
valve spool imparting motion to it in the direction which restores 
the pressure balance. The pilot spool controls the flow of oil 
to the cylinder imparting a velocity to the piston to correct 
the error which caused the transducer signal. 

Error Transducer. The error transducer is a triode electronic 
tube with a movable anode (Radio Corporation of America No. 
5734). Displacement of the anode causes a proportional varia- 
tion in anode voltage. The tube is mounted with a linkage 
so that the anode motion is restricted and is a fraction of the 
control error. 

The moving element of the tube has a natural frequency of 
12,000 cps. The linkage was designed so that its resonant fre- 
quency was above 500 eps. 

The equations for each element of the system are given in 
Appendix 1. 

Torque Motor and Amplifier. The torque motor is a device 
which converts electrical signal into a displacement. The arma- 
ture is a flat plate that has two rods attached to its center which 
are anchored at their ends. The axis of these rods is the axis 
of rotation of the plate. Mounted around the armature on op- 
posite sides of the torsion rods are two coils. This assembly is 
located between two magnets, the poles of which are separated by 
a small gap from each end of the armature. When more current 
flows through one coil than the other (a differential current exists 
in the coils) the resulting net flux in the armature supports the 
flux flowing between the magnets on one end of the armature and 
opposes the flux on the other end. This causes a torque on the 
armature in the direction of the maximum flux which twists the 
torsion rods displacing the armature. 

The torque motor used here was a Midwestern Geophysical 
Laboratory Model 9. 

Referring to Fig. 2, the torque-motor armature operates against 
two open nozzles which in turn control the oil flow from the pilot- 


456 
T, 
u 
V 
X DIFF. 
Im TRAN. 
y 
Um 
a 
y= 
Vv = 
T= 
| 


APRIL, 1957 


TORQUE MOTOR 
ARMATURE 


ORAIN 


DIFFERENTIAL | 
TRANSFORMER 


ZZLE 
Fie. 2) ScHematic DiaGrRam or Pitor VALVE 


SUPPLY 


| 


Fie. ScHematic DiaGRamM or CYLINDER AND VALVE 
valve nozzle and therefore position the pilot-valve spool as de- 
scribed in the next section. 

Pilot Valve. The pilot valve is shown schematically in Fig. 2, 
and the operation is as follows: When the torque-motor dis- 
placement y increases, the area of opening of the outlet nozzles 
decreases. Thus the control pressure p, increases, exerting a net 
force on the valve spool causing it to move to the right. This 
increases valve displacement z. Increasing valve displacement 
decreases the opening of nozzle 1. Since supply pressure is as- 
sumed constant, increasing the pilot-valve displacement causes 
the control pressure to decrease to its equilibrium value. The 
area of the supply-pressure side of the spool is equal to one half 
the area of the other side so that the equilibrium value of the 
control pressure is one half supply pressure. This type of ar- 
rangement permits an input displacement at low power level to 
position the valve spool with good accuracy at high frequencies. 

The spool is constructed so that there is negligible pressure 
drop across any bearing surface. Binding of the valve spool 
may be caused by variations in pressure around the circumference 
of the spool caused by nonconstant radial clearances between the 
spool and the valve block. The bearing lengths are made short 
by grinding the diameter of all but a short length of the spool 
lands slightly under the diameter of the bearing surface. The 
ends of the bearing surfaces are grooved and joined by drilled 
holes so that essentially no pressure drop exists across these bear- 
ing surfaces (1).5 This reduces spool friction to a very small 
value. 

The flow through the nozzles will be assumed to be the same 
as that through a sharp-edged orifice of the same area. It has 
been found that for a discharge coefficient of 0.61 this assumption 
is correct for many different nozzle configurations. This is not 
correct for very small flows or for nozzles whose area is greater 
than about one tenth the area upstream (2, 3). The nozzles 
are operated at openings of less than one quarter their diameter, 
so that the nozzle area is equal to the circumference times the 
nozzle opening. 


5 Numbers in parentheses refer to the Bibliography at the end of 
the paper. 


457 


Leakage by the spool lands and compressibility of the en- 
trapped oil will be neglected since both have little effect here. 

Cylinder. The cylinder used is a single-acting type with con- 
stant back pressure, as indicated in Fig. 3. The area of the 
constant-pressure side is equal to one half that of the variable- 
pressure side. Flow to the cylinder is controlled by a zero-lapped 
valve with no radial clearance which ideally has no leakage 
when it is centered in the valve block. This type of pilot is im- 
possible to manufacture with no clearance; however, small 
clearances have a small effect. on performance, as will be shown. 

The force-velocity curves for the pilot valve and cylinder 
combination are described in Appendix 2. 

Open-Loop Linear Analysis. The complete system equations 
in linear form as shown in Appendix 1 are as follows: 

(a) Error transducer open loop 


... [la] 


(b) Torque motor and amplifier with loop closed around it 


Kak: = [3] 


+ 20,78 +2 


(c) Pilot valve 


K, 


y TPs? + 20,78 +1 


(d) Cylinder 


1 
T,'s(T28* + 2 + 1) 
Combining these the open-loop transfer function is 
K,K,K,K, 
e aT,'s (T,28? + 2¢.T.8 + 1) 
X (T,%s? + 2¢,7,8 + 1)(7 + 20,78 + 2)....[21] 


The value of K,4 must be determined for optimum closed-loop 
response. As an approximate criterion K,4 should be adjusted 
so that 


—| =] 


e 


at 135-deg phase lag. As long as the value of 


c 
<0 
e 


at 180-deg phase lag this criterion may provide satisfactory re- 
sponse. 

The effect of the oil compressibility and load mass is to cause 
a peak in the cylinder-amplitude ratio at a “resonant”’ frequency. 
As the equilibrium piston position moves toward larger volume 
(to the right in Fig. 3) the resonant peak occurs at lower fre- 
quencies. At this resonant frequency the phase lag between 
valve position z and piston position c is 180 deg. Thus as 
cylinder stroke is increased, the frequency at which the control 
has 180-deg phase lag is decreased. As this frequency decreases 
the value of 


eK, 
at 180 deg, phase lag increases (see Figs. 4 and 5). Thus the 


open-loop gain must be reduced as the oil compressibility fre- 
quency decreases. In this system, lower gain decreases the 


| 


458 


cutoff frequency of the closed-loop frequency response and de- 
creases the rate at which the piston corrects position errors. 
Using a Nichols chart it can be shown that for a given system, 
decreasing open-loop gain causes the cutoff frequency to be lower. 
Therefore, for high performance, it is desirable to have the oil com- 
pressibility frequency as high as possible. 

The resulting open-loop response with Ky, = 4.7 A ma/volt is 
plotted in Figs. 4 and 5. 

Closed-Loop Frequency Response. The closed-loop transfer 
function is 


(K,K4K.K,)/a 
(K,K4K,K,)/a > T,'s(T 28? + 2¢.7 8 + 1) 
(7,28? + 2¢,7,8 + 1) (Ts? + + 2) 


With actual data for the system (Appendix 1) the function be- 
comes 


[23] 


c 
r 


c 1 
r “1.15 X 10-7337 + 9.96 + 6.3 XK 107" 
+ 2.3 X 10-%s* + 6.59 K 
+ 1.09 X 10-*s? + 1.4 KX +1 


. [23a] 


The closed-loop frequency response is calculated and plotted 
in Figs. 6 and 7, where it is compared to the closed-loop response 
obtained by the computer. 


NONLINEAR ANALOG-COMPUTER ANALYSIS 


System Equations. The system equations given in Appendix 1 
were set up on a George A. Philbrick Researches, Inc., analog 
computer. 


TRANSACTIONS OF THE ASME 


Closed-loop response to a step input was recorded under vary- 
ing conditions. A large step input of 0.004 in. was chosen be- 
cause in many applications of this control it will have to respond 
with fidelity to large step inputs. The step input was large 
enough to represent a typical input but small enough so that the 
error-detecting device did not limit. 

A closed-loop frequency response was run at various ampli- 
tudes and was plotted along with the linear-frequency response 
in Figs. 6 and 7. The computer analysis indicated that a gain 
of K, = 2 Ama/volt gave satisfactory performance. The phase 
lags at varying frequency correspond closely to the linear re- 
sponse, but the computer response was less oscillatory at the gain 
of K, = 2 A ma/volt and the amplitude ratio fell to 0.82 before 
rising to a peak of 1.25 at 320 eps for large amplitudes, and a 
peak of 1.95 at 350 cps for small amplitudes. : 

This discrepancy between the results of the two methods of 
analysis is due to the large reference inputs used in the non- 
linear analysis. These large inputs caused the cylinder to operate 
in regions of the force-velocity curves where the dynamic char- 
acteristics were far different from those assumed in the linear 
analysis. 

However, the two sets of results do agree within limits. Since 
the gain in the control is easily varied, the linear analysis gave 
quite satisfactory results. However, if a more accurate deter- 
mination of frequency response or transient response is desired 
at large amplitudes where linearization techniques do not apply, 
the analog-computer solution of the nonlinear equations is neces- 
sary. 

The effect of a constant force f, on the response is shown in Fig. 
8. Increased constant force causes the rise time to increase. 


(DEGREES) 


PHASE LAG 
8 


a 
$8 


FREQUENCY 


Ka = 4.7 A ma/volt 


Figs. 4,5 Linearizep Open-Loop Frequency Response 


| 
* T 
N 
° 
~ 
“0.8 
0.4 
a 
< 
0.2 
fon 
10 100 1000 
FREQUENCY ces 
a 
— = - o 
LINEAR RESPONSE 
a. 
— ° 
08 = VT. 
04|_— NON-LINEAR RESPONSE \ 
2 r= 00007%in. §-250 Ib, \ 
< re 00004 in, KY volt 
a2 1 1 
100 1000 
FREQUENCY (crs) 


fe) 
_JUNEAR RESPONSE 
K-47 ane 4 
2 
. 
‘ 
8 200 ° 
4 
|| 
< 300 
NON-LINEAR RESPONSE 4 | 
40d © r=t00004 in, \ 
100 


FREQUENCY ters) 


Figs. 6,7 Computrep CLosep-Loor Frequency RESPONSE 


|_| 
. | 
+—+> 444 
ttt 4 
meee 
| 
mate 
| | 
500 
10 100 1000 
(ces) 
: 


APRIL, 1957 


This is due to the limiting of spool motion as well as the effect 
on dynamic performance of the flattening of the cylinder char- 
acteristics. Referring to Fig. 9, it is seen that under large con- 
stant forces the maximum piston velocity decreases and the 
force-velocity curves flatten out and converge. This accounts 
for this slow response. 


Test Resutts 


The system described was built and tested at Case Institute 
of Technology. Because of various improvements made in the 
many control elements it was unfortunately not possible to test 
exactly the same system as was described in the mathematical 
and computer analysis. The actual response of the torque 
motor with the loop closed around it had indicated a lower 
natural frequency and higher damping ratio than was assumed 
for the linear analysis. The load mass was not as high but the 
cylinder-eguilibrium position was much larger so that com- 
pressibility frequency was not radically altered in analysis and 
test. A summary of these conditions is given in Table 1. 

The test results were obtained on a system mounted on a large 
and rigid I-beam structure fastened to a heavy concrete floor. 


COMPARISON OF TEST CONDITIONS 


Linear Nonlinear com- 
analysis puter analysis 
Torque motor In eps 470 


Load mass, lb. . 200 
Dry friction, Ib.. 0 ond 
0.0004 | 0.00075 


TABLE 1 


Cy inder equil. position, 0.5 
Output amplitude, in 

Amplifier gain, 3 re 4.7 
Loop gain, sec ~ ‘i 600 


Ka = 2A ma/volt, » = 0.5 in. 
(a) fe = 2000 Ib f+ = 250 lb 
(b) fe = 1000 lb tr = 250 lb 
(ec) fi = 0 Si = 250 lb 
(d) fe = 2000 Ib = 0 

(e) fe = 1000 lb fr =0 

N f=0 fr =0 


CLosep-Loop Response From NONLINEAR 
ANALOG-COMPUTER ANALYSIS 


> 


AMPLITUDE RATIO 


FREQUENCY 


000 


459 


The hydraulic-power supply must provide reasonably smooth 
pressure but no accumulators were used. Reinforced rubber 
hose was used for power-supply lines. 

The closed-loop response of the system is shown in Figs. 10 
and 11. The response is quite stable although oscillatory. The 
frequency response indicates that the transient response (not 
shown here) would oscillate several cycles before equilibrium was 
reached. 


CONCLUSIONS 


The compressibility of the entrapped oil in the large side of the 
cylinder between the valve and the piston causes the cylinder 
to have an oscillatory response. If the resonant frequency of 
this oscillatory component is not well above the desired cutoff 
frequency of the control, it will introduce enough phase lag to 
lower the cutoff frequency. This becomes a problem when the 
desired cutoff frequency is above 150 cps using a hydraulic 
cylinder of less than 4-in. diam at strokes of more than 2 in. To 
increase the compressibility resonant frequency the cylinder 
diameter should be increased, its stroke decreased, and its load 
mass decreased. 

The methods of analysis used here indicate that, for a control 
of this type, which employs a zero-lapped pilot and hydraulic 
cylinder, linearized dynamic analysis provides better than order- 
of-magnitude results when compared to the results from the 
nonlinear analysis. If accurate values of the open-loop gain and 
transient response to large input signals are desired, the non- 
linear analysis employing an analog computer must be used. 

At small equilibrium-cylinder positions the system-transient 


2500 


| 
4 
} | 


2 3 

PISTON VELOCITY (¢) 

Vatve-CyLINDER Force-VeLociry Curves CORRECTED FOR 
VaLve CLEARANCES 


Fia. 


(DEGREES) 


° 
° 


FREQUENCY 


Ka = 1.6 A ma/volt 


Figs. 10, 11 System Criosep-Loop Frequency-Response Test 


= 
Actual 
test 
350 
20 
10 
1.25 
0.001 
-00 .00. .00 
| 
0 
— 
(ces) €Ps) 


Kake 


nN 


lal 


+ 


| 


+ 


* 2.010imcH OUTPUT MOTION 
(FROM DATA BY J.W. SCHWARTZENBERG) | 


10 FREQUENCY 100 eos 


° 


AMPLITUDE RATIO 


1000 


TRANSACTIONS OF THE ASME 


is) 


| 


=20.010in, OUTPUT MOTION 
=20.015iw. OUTPUT MOTION 


(FROM DATA BY J.w. SCHWARTZENBERG) 


PHASE LAG (DEGREES) 


100 


FREQUENCY 


Fies. 12, 13 Torque Motor anp AMPLIFIER Open-Loop RESPONSE 


performance is good and it has a frequency response which is 
nearly flat to 400 cps. 

Dry friction of the type which is a constant force opposing 
piston velocity, seems to be desirable if the system is to be oper- 
ated at long cylinder strokes with large mass loads. 


ACKNOWLEDGMENTS 


The authors wish to acknowledge the support of this work at 
Case Institute of Technology by the Warner & Swasey Company. 

Significant contributions were made by a number of the 
graduate students in Instrumentation. Mr. George M. Lance 
investigated the pilot-valve systems. Mr. Jack W. Schwartzen- 
berg pioneered the amplifier and torque motor, which was further 
carried on by Mr. Steve Hemlar and Mr. Lee Gallaher. Mr. 
William F. Ortman selected the error transducer. 


BIBLIOGRAPHY 


1 “Pressure Bind Relief,” by D. P. Eckman, Notes in Course 
ME 273, Case Institute of Technology, Cleveland, Ohio, 1955. 

2 ‘Contributions to Hydraulic Control, 3,” by J. F. Blackburn, 
Trans. ASME, vol. 75, 1953, pp. 1163-1170. 

3 “Engineering Applications of Fluid Mechanies,"’ by J. C. 
Hunsaker and B. G. Rightmire, McGraw-Hill Book Company, Inc., 
New York, N. Y., 1947, pp. 157-158. 

4 “Servomechanisms and Regulating System Design,”’ by H. 
Chestnut, John Wiley & Sons, New York, N. Y., 1951, vol. 1, p. 331. 

5 ‘An Analysis of a Hydraulic Servomechanism,’”’ by G. M. 
Lance, Master’s thesis, Case Institute of Technology, 1954, Appen- 
dix 1. 

6 “Frequency of Oscillation of Cylinder Systems Due to the 
Compressibility of Oil,’’ by D. P. Eckman, Notes in Course ME 273, 
Case Institute of Technology, Cleveland, Ohio, 1955. 

7 Reference (4), pp. 311-314. 

8 “Principles of Servomechanisms,”’ by G. 8S. Brown and D. P. 
Campbell, John Wiley & Sons, New York, N. Y., 1948, p. 107. 

9 “Theory of Servomechanisms,”’ by H. M. James, N. B. 
Nichols, and R. 8. Phillips, McGraw-Hill Book Company, New York, 
N. Y., 1947, pp. 179-185. 

10 “Electronic Analog Computers,’’ by G. A. Korn and T. M. 
Korn, McGraw-Hill Book Company, New York, N. Y. 1947, pp. 
157-158. 


Appendix | 


System EQuaTIOoNs 

Error Transducer. 

tivity is K, volts/in. 
to be 


The linkage ratio is 1/a. The tube sensi- 
The response of the transducer is assumed 


where e = c — r when the system loop is closed. 
Torque Motor and Amplifier. The torque motor is a Mid- 


western Geophysical Laboratory Model 9. It is driven by an 
amplifier with high current feedback so that the droop in fre- 
quency response caused by the inductance and resistance of the 
torque-motor coils is minimized. The experimental torque- 
motor response is shown in Figs. 12 and 13. It is nonlinear; 
however, its response can be approximated by a second-order 
linear differential equation as shown by the solid curve, assuming 
9-deg phase lag at zero frequency. 

The torque-motor and amplifier transfer function (linear part) 
can then be written 


_ 
T Ps? + 26,78 +1 


e’ 


To improve torque-motor response a loop was closed around it 
and loop gain was adjusted to 1 at 135-deg phase lag. It was 
assumed the differential transformer used to detect torque-motor 
position and the feedback amplifier contributed negligible phase 
lag. 

Thus 


KK, 
e’ T 2s? + + 2 


=1 

Pilot Valve. The nozzles are operated at openings of less than 
one quarter their diameters so that the nozzle area is equal to 
the circumference times the nozzle opening. Line losses between 
nozzles 1 and 2 are very small and will be neglected. Drain 
pressure will be assumed to be equal to atmospheric pressure, 
since line losses between the drain and the sump will be neglected 
in this analysis. 

Thus the flow equations are 


= KD,(a— zx) Vp, — p, 
= — y) Vp,.... 


K = 061 ry? 
p 


These relations can be expanded in a Taylor series and if higher 
terms are neglected 
OQ» 
Qn + dp, dp, 
where p,’ and p,’ are deviations of pressure about the mean 
values when Q,; = Q,io. These partial derivatives are variable 


+ Pr 


460 
— 
NOT 
| 
ad 


APRIL, 1957 


but for small values of x, p,', p,’ they can be assumed constant 
and the equations can be linearized about some point. 
In this analysis these equations will be linearized about the 


point where 


t= 0,p~,=p,/2 and Qu = Qy 


Qp = Qno — x Ki — p,'K2 + p,’Ks 


Similarly 


Qre = — yKa + p,'Ks... 


Op, 2(p, — B,) 
Op, 
Neglecting spool leakage and compressibility of the entrapped 
oil, both of which have little effect here, the flow-velocity reiation 
can be written 


= = — p,'Ka + p,'Ks + yKa— p,'Ks | 


/ 


[6] 


Summing forces on the valve spool, neglecting viscous damp- 
ing, friction, the increase in spool inertia due to moving oil and 
flow forces, yields 


M,z 
Equations [6] and [7] ean be combined 


Kir + Ksp, + Ky — A,z) — p, 


M,i = - 
tm | 2 


but 


Canceling and rearranging yields 
od 
y Tit +2,72+1 


where 


A, 
= ok, Vie: + Ki) K, 


Cylinder. The valve has a slight radial clearance and underlap. 
Referring to the schematic representation of the valve and 
cylinder (Fig. 3) and assuming orifice flow, the flow equations 
can be written 


Qu = KD, + u)? + Ar? VP, — Pe 
KD, V (u— 2)? + Ar? Vp, 
= KD, V (xz + wu)? + Ar Vp, — 
= KD, dr Vp, 
= KD, Ar VD. — Pe 
KD, V(u—2)? + Vp 


} 


where 


K = ory? 
p 


u = underlap 
Ar = radial clearance between valve spool and block 
In the computer analysis these equations were simplified by 
assuming that u = Oand Ar = 0. For the linear analysis, Equa- 
tions [10] were linearized in the region of valve opening: 0.006 < 
xz < 0.008, and p, = p,/2, in the same manner as the pilot-valve 
flow equations 


Qu = Ker — Kop.’ + Quo | 
Que Quan 


... [10b] 


where in the region of x > u 


| 
K,= — 
Oz | 


+ ») — 


(x + u)? + Ps — Pe 


Now, assuming isothermal compressibility of the fluid on the 
valve side of the piston, the relation between flow and velocity 
can be written, neglecting line losses between the valve and 
cylinder 


Aé = Qn — Qa — Pe.- (11) 


B 


where B = isothermal bulk modulus of the hydraulic oil. 

By definition V = (so + c)A, + Vo. In this analysis, dy- 
namic variations about an equilibrium piston position so will be 
examined where >>c. Thus 


v= 8A, + Vo = WA, 


where wy is the generalized equilibrium cylinder position. 
A force balance on the piston yields 


For static balance 


A 
Pe A, yay =0 


In the linear analysis it is assumed that friction force f, is zero. 
In the nonlinear computer analysis friction force f, is assumed 
equal to 


a 


Equations [106], [11], and [12] can be combined. Neglecting 
constant terms and taking the Laplace transform of the result 
yields 


Ker — 


= 


Thus the cylinder transfer function as used in the linear analysis 
is 

1 

+ + 1) 


461 
.. [5a] 
where 
K, — K,= — = 
| 
= 
A,K; A, 0 M,é = + ......[12] 
Ke+K. 2 
} ux->z>0 
| z£>u 
} 
[10a] where 


Mw M,KeB 


Equations [10a] and [11] can be combined to yield 


2>0, Aé=KDz Ve. 


(15 


Equations [12] and [15] were used to describe cylinder dynamic 
response in the nonlinear computer analysis. 


NUMERICAL VALUES 


In the linear analysis the following were the values of the con- 
stants 

= 1830 volts/in. 
K,4 = determined by dynamic analysis 
K, = 7.5 X 1074 in/Ama 

= 0.33 

T, = 4.82 X 10~‘ see 

= 2.94 X sec 

t, = 0.673 

K, = 1.555 

T.’ = 7 X sec 

T, = 2.87 X 10~‘ sec 

= 0.386 


The open-loop frequency response is plotted in Figs. 4 and 5. 
The values for the nonlinear computer analysis were the same 
except in the case of the cylinder. 

The following equations were used to describe the cylinder 
response: 


4ys 
For se = 14.9 x V400 — — py... 1150 


4ys 
r<0, = 149x Vp, — 


For 10° 


s%c = 24.3 p,’ — 1.93 |f,| fA 


EQuaTIONsS 
The following scale factors were chosen 


F, (volts) = f,/5, lb 
X (volts) = 2500 z, in. 
Y (volts) = 3000 y, in. 
E’ (volts) = 5 e’, volts 
P,’ (volts) = p,’/10, psi 
C (volts) = 10,000 ¢, in. 
R (volts) = 10,000 r, in. 
Tt (computer time) = 100 ¢, actual time 
YW (volts) = 10,000 y, in. 


Using these scale factors the system equations as set up on the 
computer are 


E’ = 0.915 [R — C] 
0.45 
23.2 X 10-48? + 3.18 X 10-7 S + 2 


1.29 
8.62 X 10-*S? + 3.96 X 10S + 1. 


Y 
E’ 
x 
Y 


TRANSACTIONS OF THE ASME 


0.188X 0/40 — P, — 0.1 SC 
4x 


O1SP,’ = X20 


0.188X WP, — 0.1Se 


4X 10° 


0.01 = 2.43 — 0.0965 F, [gal 


Appendix 2 


Force-VELocity CurRvES 


Pressure versus flow curves can be plotted for a given valve ar- 
rangement in order to visualize more easily the region of valve 
operation in which the linearized flow equations apply. An in- 
sight into valve-cylinder operation also can be gained from force- 
velocity curves. The force-velocity curves are proportional to 
the pressure-flow curves. The proportionality factor depends 
on the area of the cylinder. These curves show graphically the 
amount of nonlinearity of the force-velocity relation. 


1.0 


ost 


\ 
\ 


\ \ 
\\ ad 
WU 


-1.0 
+2 -1.0 0.2 04 06 08 1.0 1.2 L4 


Fie. 14 Vatve-CytinpeR Force-VELOcITY CURVES 


The advantage of force-velocity diagrams is that design data 
can be read directly from the graph for transient or steady-state 
response. A force ordinate is proportional to an acceleration 
ordinate for a given system. The abscissa gives velocity. Thus 
a phase-plane plot can be directly superimposed on the force- 
velocity curves. The sinusoidal response is shown by an ellipti- 
cal path from which can be read directly the pilot-valve openings 
at given velocities and total forces. 

The force-velocity curves are derived in the following manner: 
Equation [12] can be written as 


f A, — p,/2) 
Neglecting compressibility and combining Equations [10a], 
{11], and [12a] yields 


z>0, Ai = Koay 


2< 0, Ai = + 


#>0, e=2V1—f 
<0, 


462 
ae 
oa nce hoe 
ceo 
f 
... [2a] | 


APRIL, 1957 


where 
A 


i= é 
V 

This relation is plotted in Fig. 14. The first and third quad- 
rants represent useful work done by the piston. The second and 
fourth quadrants represent the piston being driven by the load. 
Since the first and third quadrants are symmetrical, only the first 
quadrant will be considered in the analysis. The cylinder char- 
acteristics are quite nonlinear as the slopes of the force-velocity 
lines vary from 0 to ~ and the change in force due to a change 
in z varies from 0 to @. However, the cylinder characteristic s 
can be linearized to yield order-of-magnitude results. 

Actually, the valve spool has some radial clearance between it 
and the valve block to provide relief from pressure binding and 
to decrease the bearing surface. Also, the distance between the 
lands of the spool is slightly greater than the distance between 
ports. Considering these clearances a new force-velocity curve 
showing only the first quadrant is plotted in Fig. 9. The only 
effect of these clearances is a slight decrease in velocity and in 
force obtainable for a given valve opening. 


= 2f/p,A, 


% = 


Discussion 


J. L. Suearer.* The emphasis on this paper is on analysis and 


* Assistant Professor of Mechanical Engineering, Massachusetts 
Institute of Technology, Cambridge, Mass. Mem. ASME, 


463 


analytical evaluations rather than on the results of detailed ex- 
perimental work. It does appear that additional analysis of the 
system that was built and tested would be a valuable contribution 
to the work in this field. However, based on the analytical work 
alone, this paper makes a real contribution to the field of hy- 
draulic servomechanisms because the analysis does reveal a good 
quantitative picture of the many problems involved in the design 
and development of fast reliable hydraulic servomechanisms. 
It is interesting to note that the major lag in this system occurs 
in the torque motor and associated electronic amplifier and that 
this is the most severe limitation to attaining a really fast re- 
sponse. It would appear that if a faster response is required for a 
system of this kind, then a major effort would be required in im- 
proving further the dynamic response of the electromechanical 
transducer (i.e., torque motor). It is especially true in machine- 
tool control drives, where coulomb friction and similar types of 
nonlinear friction occur in the sliding ways and bearings of the 
machine, that a ramp input is a very important type of input to 
use during an analog study. The reason for this is that with a 
ramp input the output velocity has a steady component and, if 
this steady component is high enough, variations that occur due 
to an oscillatory tendency or load changes will not cause the vel- 
ocity to decrease to zero and change sign. The result is that dur- 
ing a ramp input the coulomb friction, which depends largely on 
the sign of the velocity and very little on the magnitude of the 
velocity, does not provide the stabilizing effect that it does when 
frequency-response or transient-response measurements are made. 


» 
‘ 
t 


is 
| 
| 
| 
| 
| 


Nonlinear Analog Study of a High- 


Pressure Pneumatic Servomechanism 


By J. L. SHEARER,? CAMBRIDGE, MASS. 


A detailed analog simulation was made in order to 
evaluate the effects of nonlinear valve characteristics, 
nonlinear ram-chamber compliance, and coulomb fric- 
tion in the ram on the dynamic performance of a high- 
pressure pneumatic servomechanism that had been stud- 
ied previously with a linearized analysis. This analog 
study revealed that a major part of the discrepancy be- 
tween measured frequency-response characteristics and 
the frequency-response characteristics computed from a 
linearized analysis is caused by coulomb friction in the 
ram. Although the system is decidedly unsymmetrical 
when the ram moves near one end of its cylinder, the over- 
all dynamic characteristics are nearly the same as when 
the ram moves near its center position. Thus, a simple 
linearized analysis of ram-chamber characteristics, which 
is possible only when the ram is near its center position, is 
applicable throughout most of the operating range of the 
ram. The nonlinear characteristics of the control valve 
did not seem to be significant in the problem because large 
pressure differences across the ram never were required to 
drive the mass load. 


NOMENCLATURE 


The following nomenclature is used in the paper: 


A = ram area, sq in, 

capillary-resistance coefficient, in'/lb-sec 

C, = specific heat at constant pressure, for air: 8.65 X 105 
sq in/sec?, deg R 

C, = specific heat at constant volume, for air: 6.18 X 10's 
in/sec*, deg R 

D = derivative with respect to time, d/dt, sec~! 

d/dt = derivative with respect to time, sec™ 

¢, = first input signal, volts 

é2 = second input signal, volts 

és = output signal, volts 

e, = input (grid) voltage to high-gain amplifier, volts 

F, = coulomb-friction force, lb 

F, = limiting (maximum) value of F,, lb 

F, = ram pressure force, lb 

jf, = functional relationship between valve flow W, valve 
position X, and load pressure P 

g = acceleration due to gravity, 386 in/sec? 

k = ratio of specific heats, C,/C,, for air: 1.4 

k, = amplifier gain, volts/volt 

k, = feedback taper, in/in. 


1 The work reported in this paper was carried out at the Massachu- 
setts Institute of Technology as part of an ScD thesis investigation 
entitled ‘Continuous Control of Motion With Compressed Air.” 

2 Assistant Professor of Mechanical Engineering, Massachusetts 
Institute of Technology. Mem. ASME. 

Contributed by the Instruments and Regulators Division of Ture 
AMERICAN Society OF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, December 
28, 1955. Paper No. 56—IRD-1. 


k; = input taper, in/in. 
k, = over-all gain of electromechanical servomechanism and 
potentiometer, dR,/de2, ohms/volt 

L = external load force, lb 

m = load mass, lb-sec?/in. 

P = absolute pressure, psi 

R = gas constant, for air: 2.47 X 10° sq in/sec?, deg R 
R, = potentiometer resistance, ohms 
R, = feedback resistance, ohms 

T = temperature, deg R 

t = time, sec 

V = volume, cu in. 

W = weight rate of flow, lb/sec 

X = valve position, in., zerowhen W, = W, = Oand P, = P, 
Y = ram position, in., zero when V, = V, 

Z = position of input taper, in. 

@ = phase shift between input and output, radians 

w = frequency, radian/sec 

Frequently used subscripts: 

a = aend of system 

b = bend of system 

e = exhaust 

i = initial valve (except in k;) 

s = supply 

t = stabilizing tank 


INTRODUCTION 


Hydraulic fluids have been used widely as the working medium 
in systems developed to control the motion of mass loads. Recent 
development of pneumatic control systems** has demonstrated 
that compressed gases can be used effectively for the continuous 
control of motion of mass loads. 

The schematic diagram, Fig. 1, shows a positioning servo- 
mechanism employing a valve-controlled pneumatic servomotor 
to drive a mass load and external load force with a simple feed- 
back mechanism to sense output position Y and to stroke the 
valve in such a manner that the pneumatic servomotor will 
attempt to provide changes in Y that are proportional to a low- 
energy-level input motion Z. Earlier papers* and a thesis in- 
vestigation® by the author discuss the pneumatic processes in- 
volved in the operation of a system like that of Fig. 1, and a 
linearized analysis was employed to demonstrate how stabilizing 
resistances and stabilizing tanks could be used effectively to pro- 
vide damping in the servomotor. 

When the complete servomechanism was studied experimen- 
tally in the laboratory by means of frequency-response measure- 
ments, the output motion was not sinusoidal at all frequencies and 
the measured amplitude and phase characteristics were somewhat 
different from those computed from a linearized analysis. In 


*“Study of Pneumatic Processes in the Continuous Control of 
Motion With Compressed Air—Parts I and II,” by J. L. Shearer, 
Trans. ASME, vol. 78, 1956, pp. 233-249. 

‘“Tie Simplicity to Power With Pneumatic Servomechanisms,”’ 
by H. Levenstein, Control Engineering, vol. 2, June, 1955, pp. 65-70. 

+ “Continuous Control of Motion With Compressed Air,”’ by J. L. 
Shearer, ScD thesis, Department of Mechanical Engineering, Massa- 
chusetts Institute of Technology, Cambridge, Mass., 1954. 


465 


STABILIZING 


RESISTANCES 


inpur) 


EXHAUST 


RAM 
STABILIZING CONTROL 
TANKS VALVE 
FEEDBACK 


TAPER 


m Y 
LOAD MASS (OUTPUT) 
L 


(EXTERNAL LOAD FORCE) 


Fic. 1 Scuematic DiaGRAM OF PNEUMATIC PosITIONING SERVO- 
MECHANISM 


(b) 


Fic. 2 Sxetcues oF INpuT AND OutTPpuT Wave Forms 


a) Input Z and output Y as functions of time. (b) Output Y as functions 
of sinusoidally varying input Z.) 


particular, the ram came to rest when its direction of motion 
changed in the way shown qualitatively in Fig. 2(a) when the 
input was varied sinusoidally. This dwell in output motion, 


TRANSACTIONS OF THE ASME 


which was observed with a fairly wide range of input frequencies, 
appears more distinctly when the output is plotted against the 
input sinusoid, as shown in Fig. 2(b). A small amount of coulomb 
friction was found from static measurements of the ram, and the 
dwell in output motion tentatively was attributed to this coulomb 
friction. It was apparent, however, that either an exhaustive 
experimental study or a detailed analog study would be re- 
quired to show quantitatively how the various known nonlineari- 
ties in the system contributed to system performance. Estimates 
indicated that a thorough experimental study, in which many 
important system parameters were changed, would be much more 
costly than an analog study. Although a digital computer was 
employed in a check solution intended to insure the validity of 
the analog results, the long computing time and limited infor- 
mation-storage facilities then available made a thorough digital- 
computer study inadvisable. All of the required analog-com- 
puter components were available in the Generalized Computer 
of the Dynamic Analysis and Control Laboratory (D.A.C.L.) of 
the Massachusetts Institute of Technology. 


Basic EQUATIONS AND FUNCTIONAL RELATIONSHIPS 


Valve. The characteristics of the control valve are discussed in 
detail,** and Fig. 3 is a graphical representation of the relationship 
between the weight rate of flow toward one end of the ram W,, 
valve position X, and ram-chamber pressure P,. The flow toward 


LOAD FLOW (LB/SEC) x 100 


e 
-24 
PRESSURE 800 PSIG be 
-2.8| Ty "SUPPLY TEMPERATURE = 530° R 


EXHAUST PRESSURE = O PSIG 


-32 


© 6 20 300 400 S00 600 700 800 
Po IN PSI GAGE 


Fic. Measurep Pressure-FLrow CHARACTERISTICS OF NOMINAL 
Crosep-CentTer SuipinG-PLaTE VALVE 


466 

| 
= 
| 


APRIL, 1957 


the other end of the ram W, is related to valve position X 
and the other ram-chamber pressure P, by an identical set of 
curves with only the sign of the valve position changed. In 
other words 


where f, denotes the functional relationship given by the curves. 


EXHAUST SUPPLY 


EXHAUST 
CAPILLARY RESISTANCE 


CAPILLARY 
RESISTANCE 


A 
Fie. 4 Scuhematic DiaGRam SHOWING ImpoRTANT VARIABLES IN 
PNEUMATIC SysTeM 
Ram Chambers. Applying the energy equation to the ram 


chambers, as shown in Fig. 4, gives 


9 d 


(W, — W,.)T, — C, dt [3] 
P, dav, d 
(W, — Wy)T, — = 
where 
W,.. = weight rate of flow into tank at a end, lb/sec 
W, = weight rate of flow into tank at b end, lb/sec 
7, = temperature of gas supply, deg R 
q = acceleration due to gravity, 386 in/sec?* 
C, = specific heat of gas at constant pressure, sq in/sec*—deg 
R 
V, = volume of chamber in a end of ram, cu in. 
V, = volume of chamber in } end of ram, cu in, 
i = time, sec 
k = ratio of specific heat at constant pressure to specific 
heat at constant volume, for air: 1.4 
R = gas constant, for air: 2.47 X 10° in*/sec*, deg R 


Stabilizing Resistances. The flow through the stabilizing re- 
sistances is obtained by using the expression for flow of a gas 
through a capillary resistance 


gC. 
We = app (Pat — Put) (5] 
gC. 
where 
= capillary-resistance coefficient, in*/lb-sec 
P,, = pressure in tank at a end, psi 
Py» = pressure in tank at b end, psi 


Reasonably good approximations for Equations [5] and [6] are 
given by 


Ww 


Stabilizing Tanks. 
stabilizing tanks gives 


Applying the energy equation to the 


W,.7, = 

= (Pee) (9] 
d 

Wal, = (Pa) (10) 


Ram and Mass Load. Application of Newton’s second law to 
the ram and mass load yields 


d*y 


= ram area, sq in. 
m = load mass, lb sec?/in. 

Y = ram position, in., zero when ram is in center of cylinder 
L = external load force, lb 

F, = coulomb-friction force, lb 

Ram Volumes. The ram volumes V, and V, are related to Y by 


V. = V,+A4¥F.... [12] 


where Y is measured from the point where V, = V, = V,. 
Feedback. The valve motion X is related to the input motion Z 
and the output motion Y by 


X =kZ—k,Y... 
where 

k; = input taper, in/in. 

Z = position of input taper, in. 

k, = feedback taper, in/in, 


Coulomb Friction. The coulomb-friction force is constant (in- 
dependent of ram velocity) when the ram is moving, and when the 
ram is motionless, this friction force is equal to the sum of all other 
forces acting on the ram 


aY 
F, E >o| (15a) 
dt 
d 
F, = = o| {155} 


where 
F, = limiting (maximum) value of F,, Ib 


Description oF ANALOG SysTeM 


Block Diagram of Complete System. The block diagram shown 
in Fig. 5 represents the complete set of equations describing the 
system and gives an over-all picture of the operation of the sys- 
tem. It also reveals the role played by each physical charac- 
teristic of the system and the many interactions within the sys- 
tem. 

Two of the D.A.C.L. function generators were employed to 
simulate the valve characteristics. Fig. 6 is a view of one three- 
dimensional profile and reading head that was used to generate 
the valve flow as a function of valve position and ram pressure. 
Position-type electromechanical servomechanisms provide means 


gC 
SEN 
Pus. Tee J (We-Wig) (Wo-Wip Pry Tr where 
C.P 


TRANSACTIONS OF THE ASME 


COULOMB-FRICTION 
SIMULATOR 


(M denotes multiplier; DIV denotes divider; FG denotes function generator; 1/D denotes integration with respect to time; 
= denotes summation. ) 


Fie. 5 Brock DiaGram or CompLete PNEUMATIC SERVOMECHANISM 


Fic. 6 View or Function Generator Setup To SimuLaTe a END 
or ContTrot VALVE AND TO PERFORM MULTIPLICATIONS 


[Pa X dVq/dt and Pa X (Pa — Pra).) 


of introducing a valve-displacement signal Y and a ram-pressure 
signal P,, by positioning the reading head along the cross carriage 
and by positioning the cross carriage along the machine frame, 
respectively. A linear potentiometer in the reading head delivers 
a signal proportional to the height of the three-dimensional profile 
which represents flow toward the ram, Also shown is a constant- 
slope profile and accompanying reading head clamped to the 
cross carriage. This reading head contains two potentiometers: 
One is excited with a voltage representing dV ,/dt, and the other 
is excited with a voltage representing (P, — P,,). Thus the 
voltages picked up by the two potentiometer wipers represent P, 
times dV ,/dt and P, times (P, — P,,), respectively. A similar 
arrangement simulates the situation at the other end of the ram 
and vaive. 

Each of the quotients P,V,/V, and —P,P,/V, is obtained 
with an electromechanical dividing system like that shown in 
Fig. 7. 


POSITIONING 
e SERVOMECHANISM 
L 
7 ScuHematic DiacGRam oF ELECTROMECHANICAL SYSTEM 


Usep To DivipeE BY é2 
[es = —(R1/ke)(e:/e2) when + Ri)/ka = Ri.) 


Fic. 


| 
468 
| 
hie 
Ww. 
LECTROMECHANICA 
| 


APRIL, 1957 


Fig. 8 Over-ALt View or ANALOG-CompuTerR EqQuiPpMENT 

Conventional summing, coefficient, and integrating circuits are 
used with chopper-stabilized d-c amplifiers to achieve these 
necessary operations. Coulomb friction was simulated with a 
high-gain amplifier and precision limiter. Fig. 8 is a view of the 
complete analog setup. Each system variable is represented by 
an electric signal having a voltage proportional to the value of the 
variable. The time scale used in the greater part of this study was 
8 to 1; in other words, the integrators were 8 times slower than 
real-time integration. 


Resu.tts or Dynamic System STuDIEs 


The system studied in greatest detail with the analog was the 
one which had been measured in the laboratory. The control- 
valve characteristics are shown in Fig. 3; supply pressure P, was 
800 psig; supply temperature 7’, was 530 R; half ram volume V 
was 30 cuin.; ram area A was 4.34 in.*; stabilizing tank volume 
V, was 130 cu in.; stabilizing-resistance coefficient C, was 4.46 
in’/lb-sec; maximum coulomb friction F was 13.0 lb; load mass 
m weighed 70 lb; feedback taper k, was 0.022 in/in.; and input 
taper k; was 0.25 in/in. The result of an analog simulation is 
compared in Fig. 9 to the measured response of the experimental 
system when the input Z was varying sinusoidally with an am- 
plitude of +0.050 in. 

Several series of steady-state frequency-response tests were 
simulated on the analog in order to determine differences be- 
tween the analog system and the real system, and to observe 
the effects of varying the input amplitude, coulomb friction in the 
ram, and the operating position of theram. Fig. 10 illustrates two 
of the series of tests that were simulated on the analog with the 
ram operating near its center position. The series of Fig.10 (a) was 
made with the measured coulomb friction set into the analog, 
and the series of Fig. 10(b) was made with zero coulomb friction 
set into the analog. This shows clearly that the dwell in output 
motion, which was discussed earlier, is caused by coulomb friction. 
Each series required only a few minutes to run off. Other tests 


(a) (b) 
CoMPARISON OF FREQUENCY-ResPONSE Test or EXPERI- 


MENTAL System WITH SimuLcaTep Test ON ANALOG 


[(a) Y versus Z of analog at 1.0 eps. (b) Y versus Z of experimental 
system at 1.0 cps.| 


ria. 9 


Fie. 10 Two Series or Tests SimuLatep WiTH ANALOG 
CoMPUTER 
{(a) With coulomb friction. (6) Without coulomb friction} 


that were simulated on the analog included frequency-response 
measurements with the coulomb friction doubled and wi > the 
input amplitude reduced from +0.050 in. to +0.025 in. The re- 
sults of all of these tests appear in Fig. 11, where the response com- 
puted from a linearized analysis** is given also for purposes of com- 
parison, The results from tests of the experimental system are 
shown in Fig. 12. The greatest discrepancy between the analog 
simulation and the experimental system itself appears in the fre- 
quency range of 3 to 5 eps. 

Series of frequency-response tests that were simulated for the 
ram operating near positions 2 in., 4 in., and 6 in. from its center 
position (maximum stroke = +6.5 in.) did not vary perceptibly 
from those made near the center position. It was observed, how- 
ever, that during operation near one end of the ram cylinder, the 
major pressure variation occurred in the smallest ram chamber 
and attached stabilizing tank, an effect demonstrating the need 
for two tanks and resistances. The magnitude of the pressure 
variations was always less than 100 psi, with the result that the 
control valve operated in a nearly linear fashion throughout the 
tests. Reasonably good agreement was obtained between pres- 
sure variations in the real system and pressure variations in the 
analog. 


CONCLUSIONS 


Although the linearized analysis does not predict actual system 
performance accurately, it does provide an excellent means of 
gaining a qualitative insight into the dynamic characteristics of 
the system, especially in regard to the effectiveness of the stabiliz- 


ing tanks and resistances. A fact predicted by the linearized 
analysis is that this system is very unstable when the stabilizers 
are shut off. Both the real system and the analog demonstrated 
this fact very dramatically. 

The analog proved to be a highly effective tool in attaining 
optimum designs for a system of this kind having different sets of 
requirements. A number of such designs were completed in the 


469 
kjiZ 
| 


TRANSACTIONS OF THE ASME 


1.0 2.0 
FREQUENCY IN CPS 


(a) 


10 


2.0 
FREQUENCY IN CPS 
(b) 


Computed from linearized analysis 

Simulator, no friction, +0.050-in. input 

Simulator, twice-measured coulomb friction, 

+0.050-in. input 

_—_— Simulator, measured coulomb friction, +0.050- 

in. input 
— —Simulator, measured coulomb friction, +0.025- 

in. input 


Fic. 11 or Severs Series or Steapy-StTate FrRequency-Response Tests SimuLaTep WITH 
AnaLoc CompuTeR ComPpaReD WitH Response ComputTep From LINEARIZED ANALYSIS 
[(a) Amplitude versus frequency. (6) Phase angle versus frequency.] 


470 
| 
0.4 08 4.0 60 60 100 
= 
= = 
160 
2002 04 o6 40 60 80 100 
] 


APRIL, 1957 


08 1.0 


FREQUENCY IN 


2.0 
cPS 


z 
w 
z 
< 
< 
= 


06 O8 10 


2.0 40 


FREQUENCY IN CPS 


(0) 
Computed from linearized analysis 


— — — — — —Measured with +0.050-in. input 
_ — Measured with +0.025-in. input 


Fie. 12 Resvits or Two Series or Steapy-Strate Frequency-Response Tests ON ExpeRiIMENTAL SysTeM 
Comparep Witu Response CompuTep From LINEARIZED ANALYSIS 


[(a) Amplitude versus frequency. 


space of a few hours after the analog was set up. The cost in 
time, money, and effort of trying various modifications of the 
real system would have been many times greater. 


ACKNOWLEDGMENTS 


The author is indebted to many members of the Dynamic 
Analysis and Control Laboratory for the assistance and the 
counsel given in carrying out the work reported here. 


(b) Phase angle versus frequency. } 


Outstanding among them are: H. Mori (analog setup and 
operation), C. W. Gould (experimental work), J. M. Aitken 
(drafting), and Prof. J. A. Hrones, Director of the Lab- 
oratory. 

The work described here was supported in part by the Bureau 
of Ordnance, U. 8S. Navy Department, under Contract No. 
NOrd 11799, with the Division of Industrial Cooperation, Massa- 
chusetts Institute of Technology. 


471 
\ 
0.2 
04 oe | 4.0 60 80 100 
(a) 
40 
80 
\ 
: TT 
0.2 04 6.0 80 10.0 


Discussion 


C. Muzzey.* The author’s work on gas-driven servo systems 
is extremely interesting and he is to be commended for producing 
a unit which can control a sizable mass load at a very respectable 
frequency response. The paper presents a good picture of the 
analog computer setup used to study and design the system. 

It would seem that the computer setup used would be capable 
of handling somewhat larger input signals than those reported, and 
that some transient responses could be obtained and compared 
with tests on the actual servo. On the other hand, restriction 
to small inputs probably would permit further simplification of 
the analog, particularly with respect to valve characteristics. 

The penalty in equipment size in using a pneumatic rather 
than hydraulic working fluid is considerable. For example, if 
the author’s servo ram has a half-stroke displacement of 30 
cu in. and a maximum half stroke of 6.5 in., the piston area is 
4.61 sq in. to control a 70-lb mass load. Two stabilizing tanks of 
considerable size also are required to make the gas servo behave. 
Recently we designed a hydraulic servo for a somewhat higher 
frequency response with a piston area of 0.75 sq in. for a 33-lb 
mass load. Thus, when conditions of ambient temperatures 
or fluid power supply dictate the use of a gas servo, the designer 
must be prepared to find space for a device which will be large 
when judged by familiar standards of hydraulic equipment. 


D. V. SraLiarp.’ This paper presents a lucid and penetrating 
study of a nonlinear servomechanism that is inherently complex 
and difficult to understand. 

By way of comparison, friction appears to be a more serious 
problem in a pneumatic ram servo than in a similar hydraulic 
servo, because of the much lower lubricity of air and the fact 
that it takes appreciable air flow and time to build up the break- 
away differential pressure across the ram. In the experimental 
pneumatic system, it appears that the ram area was 4.6 sq in. 
Therefore the differential pressure necessary to overcome the 
13-lb coulomb friction was only 2.8 psi. If the system fluid had 
been oil at 800 psig instead of air, then the 13-lb coulomb friction 
would not have caused a noticeable dwell in the output motion 
during reversal. 

Because friction lowers the static accuracy, many hydraulic 
servos of the valve or transmission type have utilized a dither 
signal of low amplitude and high frequency to keep the friction- 
loaded member broken loose. Perhaps a pneumatic servo would 
have a lower static error if its valve were dithered enough to 
keep the alternating ram pressure difference just lower than the 
breakaway pressure. 

In some control systems with proportional-plus-integral com- 
pensation, coulomb friction actually causes a nonlinear oscilla- 
tion at a low amplitude and low frequency. For example, Haas* 
has described an oscillating hydraulic servo on a machine tool 
with considerable load friction. In his analog computer 
study, Haas simulated a breakaway friction force which was 1.3 
times the sliding friction force. The writer suspects that the 
author’s pneumatic servo might exhibit a similar nonlinear oscilla- 
tion at a low frequency if proportional-plus-integral compensa- 
tion were added. 


6 Flight Research Department, Cornell Aeronautical Laboratory, 
Inc., Cornell University, Buffalo, N. Y. 

7 Research Engineer, Servomechanisms Laboratory, Massachusetts 
Institute of Technology, Cambridge, Mass. 

5 “Coulomb Friction in Feedback Control Systems,”’ by V. B. Haas, 
Jr., ATEE Transactions Paper No. 53-108, Applications and Industry, 
vol. 72, May, 1953, pp. 119-126. 


TRANSACTIONS OF THE ASME 


In making a sinusoidal frequency-response test, special care 
must be taken when the output wave form is nonsinusoidal or 
noisy; otherwise the phase data may be very inaccurate. One 
good method? is to form a null Lissajous pattern on an oscillo- 
scope with the output wave form and a sine wave which is shifted 
from the input wave by a known, adjustable phase angle. In this 
way, the phase angle of the fundamental frequency in the non- 
sinusoidal output wave form may be found with comparative 
accuracy and ease. By contrast, it is often misleading to measure 
the phase angle between zero crossings. 

The analog instrumentation is remarkably thorough. How- 
ever, it might have been desirable to eliminate the function 
generators which simulated the valve characteristics. Since the 
pressure variations were always less than 100 psi, the operation 
of the control valve was very nearly linear. 

It appears preferable to compare the data of the experimental 
system directly with the corresponding simulator data rather 
than with the results of the linearized analysis. When replotted, 
the experimental-system data agreed reasonably well with those 
of the simulator, except in the high-frequency region, which is 
not very important anyway. Evidently the effect of increasing 
the input amplitude is to make the experimental system behave 
more like the linearized approximation, perhaps because the 
force necessary to accelerate the ram increases with ampili- 
tude. 

It is quite probable that this and previous papers by the 
author will advance the state of the pneumatic art, which appears 
to have been retarded by a lack of understanding of the physical 
processes. 


AuTHOR’s CLOSURE 


Mr. Muzzey’s comments regarding larger input signals and 
transient-response measurements are well taken. Much of the 
work that he suggests was actually performed as part of 
the author’s doctoral thesis investigation’. However, careful 
transient-response measurements of the experimental system 
were not made because of limitations of time, effort, and facilities. 
It was found that frequency-response measurements with larger- 
amplitude inputs were rendered rather meaningless at frequencies 
above 3 or 4 cps due to flow limiting in the valve. The transient- 
response measurements that were made on the nonlinear analog 
were very interesting. In addition to step inputs, ramp-type 
inputs also were used to study stability during conditions when 
the direction of ram motion does not change; that is, when there 
are no beneficial damping effects from coulomb friction. 

The simplification of the analog for small input signals was 
carried out earlier in the author’s papers on pneumatic processes 
(3), except that coulomb friction was not included in this earlier 
analog work. On the basis of the results from the non- 
linear analog study, it is possible to predict that an analog 
which is linear except for coulomb friction should provide an 
effective simulation of this system. 

The penalty in equipment size mentioned by Mr. Muzzey 
certainly exists in some instances. In other instances, however, 
the penalty in size is negligible, depending on the size of the 
mass load to be driven and depending on speed-of-response 
requirements. 

The comments offered by Mr. Stallard are pertinent and well 
made and no additional rebuttal seems necessary. 


*“Principles of Servomechanisms,”’ by G. 8S. Brown and D. P. 
Campbell, John Wiley & Sons, Inc., New York, N. Y., 1948, pp. 
317-320. 


A Dual-Mode Damper-Stabilized Servo 


By J. JURSIK,' J. F. KAISER,? ann J. E. WARD# 


The highly oscillatory response of an inertia-damper 
stabilized servo to large step inputs can be improved 
greatly by changing, for large error, the coupling torque 
between the damper and the drive motor. This paper pre- 
sents the results obtained from an experimental model and 
an analog computer study of the dual-mode damper- 
stabilized servo. For large step inputs, this dual-mode 
method of operation resulted in a reduction of settling 
time by a factor of 5.8 and reduction of peak overshoot by 
a factor of 200 as compared with single-mode operation. 


INTRODUCTION 

NSTRUMENT servos employing inertia-damper stabiliza- 
] tion‘ have been used extensively in recent years where 

high performance is required. This type of servo stabiliza- 
tion has several advantages; i.e., very high velocity and torque 
constants may be obtained along with extremely smooth and 
reliable operation; the servo compensation is not sensitive to 
changes in the a-c supply frequency; and because the amplifier can 
be a conventional a-c type, a system free of zero drift can be con- 
structed. An inertia damper consists of an inertia slug coupled 
viscously to the servo-motor shaft, usually by means of a vis- 
cous fluid or a magnetic eddy-current coupling. 

The major disadvantage of inertia-damper stabilization has 
been that system transient response deteriorates rapidly for in- 
puts which cause the system error to exceed the linear range, 
which is quite small because of the high static system gain. This 
deterioration is brought about primarily by energy storage in the 
damper inertia during periods of nonlinear operation. Although 
a damper-stabilized repeater can be designed so that it always 
operates in the linear range during normal follow-up operation, 
the transient following any large abrupt change in input position 
which causes the repeater to slew is usually quite severe, and may 
limit the usefulness of the repeater in certain applications. 

The purpose of this paper is to show how the large-signal per- 
formance of an instrument servomechanism employing inertia- 
damper stabilization can be improved by altering the coupling 
between the stabilizing damper and the system. This coupling 
can be either a simple slip clutch, or an electrically operated clutch 
controlled by a function of the servo error. The error-actuated 
clutch, although more complicated, has been found to be superior 
because of greater reliability. Its physical design and operation 
will be described in the remainder of the paper. 


DescrIPTION OF Duat-MopE SERVO 
Physical Characteristics. A cross-sectional view of the dual- 


1 Engineer, Clevite Research Center, Cleveland, Ohio. 

2 Research Assistant, Servomechanisms Laboratory, Department 
of Electrical Engineering, Massachusetts Institute of Technology, 
Cambridge, Mass. 

? Executive Officer, Servomechanisms Laboratory, Department of 
Electrical Engineering, Massachusetts Institute of Technology, Cam- 
bridge, Mass. 

4“Damper-Stabilized Instrument Servomechanisms,’’ by Albert C. 
Hall, AIEE Paper 49-79, December, 1948. 

Contributed by the Instruments and Regulators Division of THe 
American Society oF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
11, 1956. Paper No. 56—IRD-6. 


= 


THERMOSTAT 


Fig. 1 Cross Secrion 


mode damper package attached to a servomotor is shown in Fig. 1. 
The damper consists of a sintered-tungsten slug mounted on bear- 
ings within a spun aluminum case filled with a silicone damping 
fluid. The aluminum damper case is supported by two bearings; 
one fixed in an extension of the motor housing and one running 
on the motor shaft. The clutch is composed of a small collet 
attached to the damper case, which can be expanded into a drum 
attached to the motor shaft. 

The clutch normally is kept engaged by a spring which acts on 
a push rod so as to expand the collet. The means for controlling 
this clutch is provided by the solenoid, which releases the collet by 
acting on the push rod in opposition to the spring. A 4-lb force 
is required to release the clutch and the necessary travel is 0.005 
in. The solenoid is constructed very much like a telephone re- 
ceiver. It has a circular armature, which is pulled down toward 
the center post when the coil is energized. The return spring for 
the armature is provided by the thin diaphragm attached to the 
armature at its center. The clutch can be disengaged by ap- 
proximately 2.5 watt in the solenoid coil. The operating time for 
the clutch is less than 10 millisec. Adjustments are provided for 
setting the maximum slip torque of the clutch, and the free play 
between the solenoid armature and the push rod. The total 
armature travel is established by grinding the center post during 
manufacture. 

Because the damper is sensitive to fluid viscosity, a thermo- 
statically controlled heater tube is included in the assembly to 
keep the temperature of the fluid in the damping gap at 160 F. 
The heater is required for airborne use, but may be dispensed with 
for other applications. The entire assembly is 2*/, in. long X 2 in. 
in diameter and has been designed to operate with a Mark 8, 
400-cps servomotor. A view of the complete dual-mode package 
is shown in Fig. 2. 

A block diagram of a typical instrument servo utilizing the 
dual-mode package is shown in Fig. 3. This servo will be de- 
scribed further at the end of the paper. As shown by the heavy 
arrows, the normal servo loop includes the fine synchro-control 
transformer, the servoamplifier, the servomotor, and the gearing 
to the dials and the synchro. As is usual in two-speed synchro 
data-transmission systems, a fine-coarse data switch is provided 
to switch from the fine synchro to the coarse synchro during large 


473 


DIAPHRAGM SOLENOID HEATER DAMPER CLUTCH MOTOR 
/ 
INN | 
WZ 
\ 
ARMATURE COIL 
|_| 


Fic. 2. Moror, Damper, CtutcH AssEMBLY 


F COARSE 
DATA SWITCH 


CLUTCH 
AMPLIFIER 


TRANSACTIONS OF THE ASME 


cuit. In the relay circuit, shown in Fig. 4, the coarse synchro 
voltage is amplified and applied to a full-wave detector. The d-c 
voltage thus derived is used to control a triode which drives a high- 
speed relay capable of operating in less than 1 millisec. This relay 
in turn supplies a direct current (approximately 100 milliamp) to 
operate the solenoid. 

Although the relay circuit was found to operate satisfactorily in 
laboratory-life tests, problems in operating relays in severe en- 


Fic. 4 Reray Conrrou Circuit 


SERVO 
AMPLIFIER 


Fie. Brock Diagram or Duat-Mope Servo 


errors. Stabilization of the servo in the normal mode is provided 
by the damper connected to the servomotor, and the servoampli- 
fier provides only voltage amplification. 

In dual-mode operation, when the coarse error exceeds a certain 
value (which is usually not the same as the value for fine-coarse 
switching), the clutch is actuated so as to disengage the damper 
from the servomotor. The servomotor, freed of the inertia of the 
damper, can then rapidly accelerate and slew the system to the 
new synchronization point. As soon as the error is again reduced 
to the switching angle, the damper is again connected to the servo- 
motor, and acts as an inertia brake to bring the system rapidly 
into synchronization. Usually the overshoot is so small that the 
system does not again leave the linear range after once entering it. 
The reason that the value of coarse error for clutching may be 
different from the value for fine-coarse switching is that there is a 
separate criteria for setting each one. The clutching angle is 
based on dynamic considerations to be described in the next por- 
tion of the paper. The fine-coarse switching angle is determined 
by the gear ratio between the fine and coarse synchros.’ 

A number of control circuits to operate the solenoid from the 
coarse error have been designed and tested. The two most satis- 
factory are a relay-control circuit and a magnetic-amplifier cir- 

5 “How to Design Speed Switching Cireuits,"’ by Basil T. Barber, 
ee Se November, 1954, pp. 50-53; December, 1954, 
PP. 


vironments led to the design of the magnetic-amplifier circuit, 
shown in Fig. 5. The coarse synchro voltage is again detected 
and amplified, but the relay coil in the plate circuit of the control 
tube is replaced by the control winding of a magnetic amplifier. 
The solenoid coil is operated from an a-c source by connecting it 
in series with a silicon diode and the load winding of the magnetic 
amplifier. 

The solenoid is shunted with an additional diode to provide a 
flow path for the decaying current in the solenoid during the reset 
half cycle of the supply voltage. This diode prevents large back 
voltages from appearing across the solenoid and makes the 
solenoid operation much less noisy. The advantages of the mag- 
netic-amplifier circuit are primarily increased reliability in appli- 
cations where the use of a high-speed relay might give trouble, 
and the elimination of the d-c source for solenoid operation. 

Performance. As has been stated, the transient performance 
of a damper-stabilized servo for large signals, such as those en- 
countered in synchronizing to the new position, is sometimes un- 
desirable. A typical large-signal transient for single-mode opera- 
tion of the system® of Fig. 3 is shown in the upper portion of Fig. 
6. It will be noted that the system has an 80 per cent initial 
overshoot and requires at least 6 complete cycles to reach and 
stay within the linear range of operation. 


* The system, which was used as an experimental model for this 
development, is described in detail in the Appendix. 


474 
DC. 
200K 27K 2K 247K 
ERROR 
VOLTAGE 
SERVO 
| 
4 
SYNCH 
cT 
3 
FINE 
SYNCHRO 
| 
SYNCHRO 
cT 
SPEED SYNCHRO 
TERS (180 TURN) 


APRIL, 1957 


Fic. 5 Circuit 

With the dual-mode system in operation, the transient shown 
in the lower part of Fig. 6 is obtained. In this mode the system 
initially reaches the desired position more rapidly because of the 
reduced system inertia with the damper disconnected. Recon- 
nection of the damper just prior to crossing the desired position 
results in an exceedingly high rate of deceleration which prevents 
the system from overshooting more than a fraction of 1 per cent. 
Once within the linear range, the servo completes synchronization 
rapidly because of the wide bandwidth of normal servo loop. The 
system completely settles in less time than is taken to reach the 
first overshoot in single-mode operation. 


Design CoNsIDERATIONS 


Before delineating the design criteria, it is necessary to con- 
sider the type of input which the system is to follow. The input 
consists of a series of ramps of variable slope separated by large 
step changes. Straight damper compensation gives satisfactory 
performance during the ramp input. The second mode of opera- 
tion is added only to improve the system response to the large 
step changes. 

The dual-mode system offers great improvement in response to 
large steps as has been shown in Fig. 6. A second approach con- 
sists of the use of a slip clutch to connect the motor to the damper. 
These three systems, single-mode, dual-mode, and slip-clutch, are 
compared in Fig. 7 as to their time response to various size steps. 
The responses correspond to three step sizes representing linear 
operation, moderate saturation, and hard saturation of the error 
amplifier. The single-mode step responses show the degeneration 
in response with increasing step size. The improvement due to 
dual-mode operation is realized for large steps only, i.e., those 
sufficient in size to cause damper declutching. The slip-clutch 
system alters all step responses that demand a damper driving 
torque in excess of the slip-clutch limit. This effect is seen in the 
step response for steps as small as 0.02 deg. A marked improve- 
ment over both the single and dual-mode systems is noted in the 
step response for steps in the range of a few degrees in size. 

From Fig. 7 it appears that the slip-clutch system gives the best 
response over a wide range of input sizes. However, it was found 
experimentally, as shown in Fig. 8, that the response is affected 
greatly by the slip-clutch torque setting. For small values of slip 
torque the small-step response became more oscillatory as a result 
of reduced effective damping. For larger values of slip torque the 
large-step response became more oscillatory. The effectiveness of 


__4-SINGLE-MODE 


—+ 


4 


2 
TIME IN SECONDS 


TIME IN SECONDS 


Fic. 6 Experimentat Servo Response To 53-Dec Strep 


STEP STEP 53° STEP 


Fic. 7 Response or EXPERIMENTAL SERVO TO Various Size Steps 
FoR Sincie-Mope, Duat-Mopr, anv Siip-CLurcH OPERATION 
53° STEP 


002° STEP 10° STEP 


f\ 


2 4 2 


SECONDS = SECONDS 


Fie. 8 Errect or CHance 1n Serrine on Response 
to Strep Inputs ror ExpeRIMENTAL SERVO 


the slip-clutch coupling hence depends on the maintaining of a 
constant value of slip-torque. This then becomes a stringent 
requirement on the mechanical design of the slip clutch. 

The main factors affecting the design of the dual-mode system 
are: 

1 Solenoid pull-in and drop-out times. 

2 Type of signal; i.e., what function of the error is used to con- 
trol clutch actuation. 


475 
| 2K | | | 
| 
FIRING WINDING 
MAGNETIC CIRCUIT FOR DUAL- 
| MODE SERVO 
IN 336 j } 
‘ | puat-move! 
SINGLE-MODE VAN 
HA 
DUAL-MODE VANS 
2a 
SLIP-CLUTCH 
SECONDS SECONDS SECONDS 
1.7 IN. OZ 
SECONDS 
&g 


476 


3 Driving and drag torques of the clutch connecting the 
damper to the motor. 


The objectives behind disconnecting the damper during the 
large-error state are to allow the motor to accelerate most rapidly 
to slew velocity and to minimize the kinetic energy stored in the 
damper during slew. To take full advantage of the clutched mode 
the solenoid pull-in and drop-out times must be as small as possi- 
ble. (The action of the solenoid dropping out causes the clutch to 
engage.) Long pull-in time results in the damper acquiring ap- 
preciable rotational energy which must be absorbed during the 
final settling transient. Long drop-out time effectively means 
that the solenoid must be deactivated while the error is still large, 
thus requiring a wide zone of single-mode operation if clutch 
actuation depends on the magnitude of the error alone. 

With the damper disconnected, the system can accelerate to full 
slew velocity within 3 per cent of the total travel of the system. 
Since the probability distribution of the amplitude of step inputs 
is approximately uniform over the range 0 to 100 per cent of total 
system travel, the system will reach full slew velocity for about 94 
per cent of the step inputs. Therefore the magnitude of error can 
be used as the sole means to actuate the solenoid. The clutch re- 
engagement time can be compensated for by increasing the 
switching angle’ by an amount equal to the maximum output 
velocity times the drop-out time of the solenoid. The addition of 
error-rate information to control the solenoid would be effective 
in reducing settling time for less than 5 per cent of the step inputs. 
The added complexity of adding an error-rate signal seems un- 
warranted in the system studied. 

The determination of the optimum clutching angle was done 
experimentally with the results shown in Fig. 9. Minimum ampli- 
tude of the first overshoot is used as the criterion for this deter- 
mination. The parameter for the set of curves is the maximum 
torque that the clutch can transmit in the engaged state. Increas- 
ing the maximum value of transmitted torque results primarily 
in a reduction of overshoot amplitude with a small reduction in 
optimum clutching angle. 

Fig. 10 shows the final transients after clutching for various 
clutching angles. The clutch torque slip level of 5.8 in. oz is 2.4 
times the maximum stall torque of the drive motor. It isinterest- 
ing to note that the change in clutching angle affects only the 
amplitude of overshoot, with the final transient settling time being 
about the same for all cases. The results of the analog-computer 
study corroborated the data shown in Fig. 9, which indicates 


7 The switching angle is defined as the error angle at which the 
solenoid is de-energized. The clutching angle is defined as the error 
angle at which the clutch engages. The two angles are different be- 
cause a finite time is required for the solenoid force to build up or 


decay. 


TRANSACTIONS OF THE ASME 


T* CLUTCH TORQUE 


DEGREES 
OVERSHOOT 


IN-OZ 


2 3 4 5 
CLUTCHING ANGLE IN DEGREES 


Errect oF VARIATION IN CLUTCH ToRQuE ON OptTimuM 
ANGLE AND OVERSHOOT FOR A 53-DeG Strep WitH 
EXPERIMENTAL Dvat-Mope Servo 


Fie. 9 
CLUTCHING 


CLUTCH TORQUE =5.8 IN-OZ 


OUTPUT POSITION IN DEGREES 


i 


4 4 
0.1 02 0.3 0.4 0.5 06 
TIME IN SECONDS 


Fic. 10 Finau TRANSIENTS CLUTCHING FOR EXPERIMENTAL 
Dvat-Mope Servo ror Various CLurcuine ANGLEs 53-Dea 
Strep Input 


that there is a well-defined minimum. Also studied on the analog 
computer was the effect of motor velocity at the time of clutching 
on the optimum clutching angle. These results, which are per- 
tinent only for those cases where the motor has not reached slew 
velocity, are shown in Fig. 11. The computer results also indicate 
that settling time is approximately a minimum for the clutching 
angle that gives minimum overshoot. 

For a fixed clutch slip torque of 5.8 in. oz, the optimum eclutch- 
ing angle was determined (see Fig. 10) to be 2.1 deg. To this 


DEGREES OVERSHOOT 
9 


CLUTCH TORQUE + 4.0 IN-OZ 


CLUTCHING ANGLE IN DEGREES 


Fie. 11 


Errect oF VARIATION IN Moror VevLocity at CLutcHinc Upon Optimum 


CLuTCHING ANGLE AND OVERSHOOT MAGNITUDE 


| 
& 


APRIL, 1957 


10 
COMPUTED 


EXPERIMENTAL 


SETTLING TIME NORMAL 
OPERATION 

Tom 7SETTLING TIME DUAL-MODE 
OPERATION 


4 


N 


40 60 80 100 
INPUT STEP IN DEGREES 


160 


Fig 12) Serruinc Time Repuctrion ror Duat-Mope 


APPLICATIONS 


Good response to both large and small step inputs, along with 
the fast settling times provided by dual-mode operation make the 
solenoid-actuated clutch attractive in spite of its added com- 
plexity to the system. It is ideally suited for improving the syn- 
chronizing performance in a damper-stabilized synchro data re- 
peater.* Fig. 14 shows a data repeater with the dual-mode pack- 
age as shown in Figs. | and 2 installed. 

This instrument has a gain-crossover frequency of 40 cycles per 
sec (cps), a velocity constant® in excess of 10,000 sec~', a static 
accuracy of 0.02 deg, and a maximum slew velocity of 320 deg per 
sec. The repeater can follow smoothly at speeds as low as 0.01 
deg per sec., giving a total speed range greater than 30,000 to 1. 
By adding the dual-mode feature, the settling time for a 180-deg 
step input is reduced from 5 to 1.1 sec, hence keeping total syn- 
chronizing time under 1.1 sec for any size step. The clutch mecha- 


Qp=OAMPER VELOCITY AT CLUTCHING 
Q5=MAXIMUM MOTOR VELOCITY = 6000 RPM 
MOTOR VELOCITY AT CLUTCHING = 2000 RPM 


Ww 
2 
75) 
a 
te 
a 


1 


1 


-06 


.O7 -08 


TIME IN SECONDS 


Fie. 13) Errecr or Damper VELOCITY ON FINAL TRANSIENT 


must be added the solenoid delay angle, equal to the maximum 
output rate (320 deg per sec) times the drop-out time of the 
solenoid (10 millisec), giving a total switching angle of 5.3 deg. 
The switching angle then represents the minimum error which 
will cause the system to switch to the high acceleration mode. 
With this setting, the system was subjected to step sizes ranging 
from 5.3 to 160 deg. The resultant reduction in settling time of 
the dual-mode system over the single-mode system is shown in 
Fig. 12. The analog-computer results show a higher factor of 
settling time reduction due primarily to an assumed ideal clutch 
characteristic. 

As has been shown, the maximum torque transmission of the 
clutch affects the amount of overshoot to some degree. Further 
test results show that little is gained if the maximum clutch 
torque is raised above approximately four times the rated 
stall torque of the drive motor. This relaxes somewhat the require- 
ments on the mechanical-clutch design. It is necessary for the 
clutch to exhibit very low drag torque in the disengaged state to 
prevent the damper from acquiring an appreciable velocity during 
slew. The overshoot magnitude and settling time increase meas- 
urably with increasing drag torque as observed in Fig. 13. If 
drag torque for a fixed clutch design is excessive, a brake may be 
used to stop the damper during slew. 


§ “Better Synchro Repeaters From Damper-Stabilized Feedback,"’ 
by John E. Ward, Control Engineering, vol. 2, July, 1955, pp. 90-91. 

* Velocity constant is defined as the ratio of the magnitude of a 
constant input velocity to the steady-state following error resulting 
from the constant velocity input. 


Fie. 14 Syncnro Data REPEATER 


477 
8 
6 
Tom 
4 
_ 
| 
| | 
= 
05 
105 02 03 04 Oo. .09 10 aT 12 A3 
J j 
INCHES 


478 


nism showed negligible signs of wear and no deterioration in re- 
sponse characteristic after responding to 10,000 cycles of large- 
step operation. 


ACKNOWLEDGMENT 


The research reported in this paper was made possible through 
the support extended the Massachusetts Institute of Technology, 
Servomechanisms Laboratory, by the United States Air Force 
(Armament Laboratory, Wright Air Development Center), under 
Contract No. AF33(616)-2038, and by the joint support of the 
Department of the Army, the Department of the Navy, and the 
Department of the Air Force under Air Force Contract No. AF- 
19(122)-458 with Lincoln Laboratory, M.I.T., and executed 
under subcontract by the Servomechanisms Laboratory, M.I.T. 
It is published for technical information only and does not repre- 
sent recommendations or conclusions of the sponsoring agencies. 


Appendix 
NOMENCLATURE 
The following nomenclature is used in the Appendix: 


average early acceleration with damper, rad sec~* 

average early acceleration, without damper, rad sec~?* 

damping constant of damper, in-oz sec rad~ 

internal damping of motor, in-oz sec rad~* 

gear ratio from motor shaft to output 

inertia of damper slug, in-oz sec? rad~! 

inertia of motor rotor, gears and load, in-oz sec* rad ~! 

inertia of damper shell, in-oz sec* rad~! 

gear ratio~! from output to fine synchro 

synchro output constant, volts deg~ 

amplier gain 

conversion factér from radians to degrees, 180/72 

system torque constant, output torque per deg error, in-oz 
deg 

motor torque constant, in-oz volt! 

Laplace operator 

internal motor torque, in-oz 

maximum slip torque of clutch, in-oz 

stall torque of motor at rated control voltage, in-oz 

applied motor voltage, volts 

ratio of inertias defined by Equation [8] 

ratio of inertias defined by Equation [14] 

error, deg 

input position, deg 

motor angular position, radians 

output position, deg 

major velocity build-up time constant with damper, sec 

velocity build-up time constant without damper, sec 

damper angular velocity, rad sec™! 

2, motor angular velocity, rad sec™! 

2, = maximum velocity of motor, rad sec™! 


TRANSFER FUNCTION FOR DAMPER-STABILIZED SYSTEM 


From Fig. 15, using terminology previously given, the trans- 
fer function for the damper-stabilized system is derived as follows 


T = + Je) SQm + fa(Qm— Qe) + fn Xm 
— 24) = JSQ, 


Eliminating 2, gives 


TRANSACTIONS OF THE ASME 


CLUTCH COUPLED 


(Jn + J,) | 
This can be written more simply in the approximate form 
+ Wa) 


Vn + J,) (s + (S + aw,) 


Further 


Kr (S + Ww) 
+ J,) (s (S + aw,) 


= 


The system is then shown in block diagram in Fig. 16. 


Vv Kr (S+w) 8m 
(Umts S(S+ )(S+aug) 


Fic. 16 Sysrem Brock DiaGram 
For the damper declutched, the equations become 
T=K 
T = + 


+ Boom) 


= 
‘ 
Am tn fy Ng 
Fie. 15 Damper-Moror ScHEMATIC 
Sa 
A 
2 
where 
Sa _ 
[7] 
In tJ, + Je 
on 
6,, K, 
6, 
13) where 
| 


APRIL, 1957 


For large step inputs the acceleration capabilities for the system 
with and without the damper are compared as: 
(a) For the damper in the system 


@g = aBJ,, [15] 
(b) For the damper disconnected 
T 
5" [17] 
[18] 
f= 
Bw, 


Hence for the damper disconnected the acceleration is greater 
by the factor a8 which, for the system considered, is about 25. 

For the system described in this paper the constants have the 
following values: 


K, = 27 

K, = 0.4 volt 

K, = 57.3 deg rad 

n = 108 
Ky = 0.016 in-oz volt! 
J,, = 0.72 X 10~‘ in-oz sec? rad~! 
J, = 1A1 X 10~‘ in-oz sec? rad~' . 
= 1.69 X in-oz sec* 
Sa = 1.23 X in-oz sec rad~ 
fa = 0.156 in-oz sec rad 

T, = 2.4 in-oz 

Q, = 600 rad sec 


Using these values, the transfer function of Equation [10] is 
found to be 


9m __75(S + 92.3) [19] 
V S(S + 0.65)(S + 825) 0° 

= 2.15 X 10’ volt rad~'. .[20} 
n 


1.5 sec 
T, = 0.0585 sec 
K, = 7000 in-oz deg~! 


Discussion 


G. A. Brernson.” Damper compensation is ideally suited for 
many instrument-servo applications because of the high stiffness 
and wide bandwidth that can be achieved with rather simple 
electronics. The dual-mode technique described in this paper 
provides an excellent solution to the poor synchronizing response 
which is the main limitation of damper compensation in such 
applications. 


® Advanced Research Engineer, Sylvania Electric Products Inc., 
Electronic Systems Division, Waltham, Mass. 


479 


It should be pointed out, however, that when there is significant 
load inertia, damper compensation is generally inferior to ta- 
chometer compensation, because of the following: 


(a) The sustained acceleration capability of the motor is much 
less when coupled to a damper, because the motor must ac- 
celerate the damper slug. 

(b) For good stability the allowable reflected load inertia with 
respect to the motor must be quite small in comparison to the 
damper-slug inertia. 


Consequently, the acceleration capability of the output is quite 
restricted with damper compensation if there is much load inertia. 

The acceleration capability of a damper-stabilized motor is 
equal to the motor torque divided by the total inertia coupled to 
the motor shaft which includes the inertia of the floating slug 
and the direct-coupled inertia. When full motor voltage is first 
applied the damper-stabilized motor can achieve instantaneously 
a much faster acceleration because it need only accelera e the 
direct-coupled inertia. However, if it is to sustain an acceleration 
for a reasonable period of time, it also must accelerate the damper 
slug, which is usually at least six times as large as the direct- 
coupled inertia. 

The direct-coupled inertia includes the motor-shaft inertia, the 
reflected load and gear inertia, and the damper-shell inertia. 
The damper-shell inertia, unfortunately, is from 5 to 10 per cent 
of the slug inertia and hence may account for as high as 60 per 
cent of the total direct-coupled inertia. Consequently the allowa- 
ble value of the reflected load inertia is quite small in comparison 
to the total inertia coupled to the motor shaft. 

The dynamic error of a damper servo in response to a low-fre- 
quency input is determined primarily by the acceleration-error 
coefficient. The value of this coefficient for the servo described in 
this paper unfortunately has not been mentioned. 


Rurvus OLpENBURGER." The use of clutches as described here to 
modify the transfer function to improve performance is particu- 
larly effective for small servos, especially of the military variety 
where clutch wear may not be too big a problem. From our stud- 
ies we have concluded that when it comes to large physical de- 
vices, such as governed engines, inertia-damping stabilization is 
out of the picture because of bulk, cost, and other factors. 
Further, our experience is that, when the clutch is slipping, the 
1orce that exists depends on a number of unknowns and cannot 
normally be taken into account in accurate analytical studies. If 
the clutching and declutching are done quickly enough this may 
not cause trouble. 


Avutuors’ CLosuRE 


We appreciate the time Mr. Biernson spent in preparing his 
comments although they are not considered to be appropriate. 
The purpose of the paper was to describe a technique for improv- 
ing the response of a particular system and not to discuss the 
relative merits of various compensation schemes. 

Also we do not consider it particularly unfortunate that the 
acceleration-error coefficient was not mentioned, 


1! Director of Research, Woodward Governor Company, Rockford, 
Ill. Mem. ASME. 


= 
| 


Experiments With Optimalizing Controls 
Applied to Rapid Control of Engine 
Pressures With High-Amplitude 


Noise Signals 


By GEORGE VASU,' CLEVELAND, OHIO 


Optimalizer control principles were applied to the con- 
trol of a flight-propulsion system. The control system 
described metered the fuel flow to an engine in such a 
manner as to cause the engine to seek a maximum pres- 
sure. Experimental data are presented illustrating the 
control behavior for a range of flight conditions, for 
various control settings such as gain and integral time 
constant, for various amounts of filtering, and so on. 


NOMENCLATURE 
The following nomenclature is used in the paper: 


Ay = amplitude of optimalizer test signal 
A, amplitude of controlled pressure signal resulting from test 

signal 

amplitude of a particular noise-frequency component 

gain of proportional part of control 

general function denoting noise 

controlled pressure (engine-compressor output pressure), 
psfa 

time, sec 

test-signal voltage, volts 

filtered-multiplier output voltage, volts 

unfiltered-multiplier output voltage, volts 

amplified voltage proportional to controlled pressure, 
volts 

amplified and filtered signal at output of bandpass filter, 
volts 

fuel flow, lb/hr 

phase angle between signals, radians 

time constant of integral control, sec 

frequency, radians/sec (rps) 

frequency of a particular noise component, radians/sec 
(rps) 


INTRODUCTION 


In recent years, automation or automatic control has done much 
to stimulate progress. Of the many types of automatic controls 
possible, that class of controls employing optimalizing principles? 

' Assistant Head, Section A, Controls Branch, Lewis Flight Propul- 
sion Laboratory, National Advisory Committee for Aeronautics. 

2 “Principles of Optimalizing Control Systems and an Application 
to the Internal Combustion Engine,” by C. 8. Draper and Y. T. Li, 
Aeronautical Engineering Department, Massachusetts Institute of 
Technology, published by ASME, September, 1951. 

Contributed by the Instruments and Regulators Division of THe 
AMERICAN SocretTy OF MECHANICAL ENGINEERS and presented at the 
ASME-AIEE Conference on Nonlinear Control Systems, Princeton, 
N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
31, 1956. Paper No. 56—IRD-14. 


481 


is certain to become of increasing importance in the coming years 
as people become more familiar with such systems. 

In the field of automatic controls for flight-propulsion systems 
there are numerous s*tuations in which optimalizing principles 
can be applied. One of these situations is considered in this paper. 
Specifically, the paper presents a discussion of an optimalizing 
control which varies the input fuel flow to a flight-propulsion sys- 
tem in such a manner as to produce the maximum output pres- 
sure automatically. The paper first describes the type of optimal- 
izer control used, with a discussion of its principle of operation, 
including the effect of component dynamics and noise on control 
operation. Next, the basic behavior of the system as determined 
experimentally is discussed; and finally, the effect on performance 
of filtering, test-signal amplitude, and so forth, is treated. 


Tueory or ConTROL 


Principle of Operation. For the application under discussion, 
the region of desirable engine operation can be related to the re- 
gion near peaks in pressure fuel-flow characteristics such as shown 
in Fig. 1. The existence of such peaks suggests the possibility of 
utilizing optimalizer control principles? to insure operation in the 
desirable region throughout a range of flight conditions. 

The particular type of optimalizer control chosen was the con- 
tinuous test-signal type. A block diagram of the system is shown 
in Fig. 2. A sinusoidal test signal (Ao sin wt) was introduced at 


FUEL FLOW, 


Fia.1 Enoine-Compressor Pressure Ratio—Fvue. Sratic 
CHARACTERISTICS 


r- TEST SIGNAL + A, sin wt 


Fie. Brock DiacraM or OpTiMaLizeER CONTROL 


N : 
J 
J 
RELATIVE 
MACH 
NUMBE 
1052 
CONTROLLED 
Vat 
|_| 


482 


the fuel servo input as illustrated. This test signal caused periodic 
variations in fuel flow and engine pressures. A particular pressure 
indicative of engine output was sensed and amplified. The result- 
ing signal was then filtered with a bandpass filter to remove any 
noise present in the signal, to remove the steady-state or d-c 
pressure level, and to pass primarily the fundamental test-signal 
frequency. 

The output of the bandpass filter consisted, then, essentially of 
a signal of test-signal frequency with an amplitude determined by 
the gains of the servo, engine, sensor, amplifier, and filter, plus 
some phase shift due to the dynamics of these same components. 
The signal at the multiplier (Vx:) thus can be represented by A, 
sin (wt + @). 

The test signal is also introduced into the multiplier. The out- 
put of the multiplier is the product of the two inputs giving 


= [Ao sin wt][A; sin (wt + O)]......... 01] 
or 
ViVas = AoA; sin wt sin (wt + 8)........... [2] 
Utilizing the trigonometric formula 
sin a sin 8 = '/.[cos (a — 8) — cos (a + B)].....[3] 
gives 
AoA, sin wt sin (wt + 4) 
[cos — cos (2wt + 6)]...... [4] 
or 
Vimo = [cos — cos (2wt + 8)]........ [5] 


Thus the signal out of the multiplier consists of a d-c component 
and a second harmonic of the test-signal frequency. 

The purpose of the low-pass filter following the multiplier is to 
pass the d-c component and to filter out the second harmonic. 
The control therefore receives essentially the d-c component. 

The magnitude of the d-c component is a function of the ampli- 
tudes of the two multiplier input signals and is also a function of 
the phase shift between them. The (AoA:/2) cos (—@) term of 
Equation [5] consists then of A» which is the amplitude of the test 
signal that was introduced directly into the multiplier; A; which 
is the test-signal amplitude times the gain of the servo, engine, 
sensor, amplifier, and bandpass filter; cos (—@) which is the co- 
sine of the phase angle between the two signals at the multiplier 
input; and a factor of '/2. 

The magnitude of the signal to the control is therefore (A 9A;/2) 
cos (—8@) and the control action will depend on the factors which 
make up this d-c component. 

In order to establish proper directions to the fuel-flow change 
such that the peak will be sought, consider the algebraic sign of 
the d-c component (AoA1/2) cos (—8). Ao and A; are amplitudes 
of periodic signals and can be thought of as having positive signs. 
The phase angle @ will depend on dynamics of components and 
upon the algebraic sign or the slope of the static characteristics of 
each component. 

For simplicity, consider that the algebraic sign of each com- 
ponent except the engine is positive. In the engine the algebraic 
sign or slope of the static characteristic is positive to the left of 
the peaks and negative to the right. In other words, to the left 


of the peaks (neglecting dynamics) the phase angle 0 is zero and to. 


the right it is 180 deg. Cos (—6) then will be positive on the left 
and negative on the right and as a result the whole term (A,A;/2) 
cos (—@) (neglecting dynamics) will be positive when the engine 


TRANSACTIONS OF THE ASME 


is operating to the left of the peaks and negative to the right. 
When operating on the left, the control then receives a positive 
signal which is fed to the fuel servo to increase fuel flow. When 
operating on the right of the peaks, the signal is reversed, causing 
the fuel flow to decrease. In either case, the control will seek the 
peaks in the characteristics and the system will operate in the de- 
sirable region. 

Effect of Component Dynamics. Thus far, component dynamics 
have been neglected. The effect of dynamics is to introduce 
phase shift in the signals traveling through the system and also to 
attenuate or amplify the various frequency components, so that 
usually the gain for a particular frequency is different from the zero- 
frequency gain associated with the static characteristics. 

The effect of dynamics on the control action resulting from the 
(AoA;/2) cos (—@) term of Equation [5] can be defined as follows: 
(1) Any attenuation or amplification of the test signal frequency 
as it passes through the components from the servo to the 
multiplier will affect A:, and A; should be determined by the gains 
of components at the test-signal frequency if there are appreciable 
dynamics in these components. (2) The output of the multiplier 
will be affected as a function of cos (—@) and the signal to the 
control will vary accordingly. 

Consider the variation of cos (—6) as a function of 6. For 
0 < < 90 deg, cos (—9) decreases from 1 to 0. For 90 < 
180 deg, cos (—@) varies from 0 to —1. Therefore phase shifts of 
up to 90 deg cause an attenuation of the signal to the control, but 
polarity is still suitable for satisfactory behavior. Phase shifts of 
more than 90 deg, however, cause a change in sign to the control 
which would cause divergent or runaway control action. 

A method of eliminating the problems arising from phase shift 
caused by component dynamics is to insert an equal amount of 
phase shift into the test signal before it enters the multiplier. The 
two signals entering the multiplier would then be in proper phase 
relation. 

Effect of Noise. With noise present in the system, the signal to 
the multiplier from the band»ass filter can be cons‘dered as A, sin 
(wt + 8) + N(t). The effect of the first term A; sin (wt + 6) 
which is the test signal frequency component has been discussed. 
The noise N(t), in general, will be made up of a large number of 
frequency components. Consider a typical frequency component 
A,, sin w,t. The contribution to the multiplier output due to this 
particular component of noise will be 


A,A, sin wt sin w,t 


[cos (wt — w,t) — cos (wt + w,f)]...... 


These are the familiar sum and difference frequencies arising in a 
modulation or multiplication process. No d-c component is pro- 
duced unless a noise component exists with exactly the same fre- 
quency as the test signal. In many cases such an occurrence is 
unlikely. When such an occurrence is likely, proper attention 
must be given to the choice of the test-signal frequency, to filter- 
ing action, and to the amplitude of the test signal used. 

The sum and difference frequency components in the multiplier 
output would generally be of small magnitude as a result of the 
filtering action of the bandpass filter on the noise in the controlled 
pressure signal. In any case, whether they are large or small, they 
would contribute to control action and the result would be a con- 
trol response with these superimposed frequency components. 
As long as the noise frequency is different from the test-signal 
frequency, the product will be an a-c signal which will produce no 
change in the average value of fuel flow. If the magnitude of the 
a-c components is objectionable, the solution is to filter out those 
frequencies. 

It appears, then, that the multiplier does an excellent job of dis- 


| 
| 
| 


APRIL, 1957 


criminating between noise and the test signal. There is a special 
situation which may cause some difficulty, however. If a strong 
noise component exists at a frequency close to that of the test 
signal, the difference frequency component produced in the multi- 
plication process would be a very low-frequency signal, causing the 
fuel flow to vary at this frequency. The test-signal frequency 
should be chosen so that there is no single noise component with 
a large amplitude and a frequency near that of the test signal. 
Large amplitudes of noise at frequencies farther from the test- 
signal frequency can, of course, be filtered out as previously 
mentioned. 


DESCRIPTION OF EXPERIMENTAL SYSTEM 


Pressure Sensor. The engine pressure being controlled was 
sensed by a variable-inductance type of pressure transducer. 
The tubing connecting the transducer to the pressure probe in the 
engine was made as short as possible to reduce dynamic effects. 
The frequency response of the sensor with tubing was approxi- 
mately equivalent to that of a second-order system with a damp- 
ing ratio of 0.5 and a natural frequency of 130 cycles per second 
(eps). At 20 eps there was less than 5 per cent rise in amplitude 
ratio and less than 10 deg phase shift. At frequencies below 20 
eps, which is the range of most significance in the control system, 
amplitude and phase errors were very small and can be neglected. 

Carrier Amplifier. The output of the sensor was amplified by 
a carrier-type amplifier designed for frequencies from zero to 300 
cps; therefore, for frequencies below 20 cps, amplitude and phase 
errors were again very small. 

The gain of the sensor and carrier amplifier was 1.08 X 10-3 
volts per psf. 

Bandpass Filter. The bandpass filter used in the investigation 
was a commercial type with adjustable upper and lower cutoff: 
frequencies. The gain in the pass band was approximately unity. 
On each side of the pass band the attenuation was 24 decibels per 
octave. The transfer function of the filter was approximately 


w*r;4 
(1 2jAwr, + 2jAwr2 = w*r,?)? 


where the low cutoff frequency is '/.47,, the high cutoff fre- 
quency is '/:4r2, and the factor A is slightly greater than 0.6. 

For all of the data presented except Figs. 3 and 4 and where 
specifically indicated, the low cutoff frequency was 1 cps and the 
high cutoff frequency was 4 cps. 

For Figs. 3 and 4 the low cutoff frequency was 0.5 cps and the 
high cutoff frequency was 8 cps. 

D-C Amplifiers. An electronic analog computer was used for 
that part of the control system from the bandpass-filter output to 
the fuel servo input. The output of the bandpass filter was ampli- 
fied by a d-c amplifier in the computer before the multiplication 
was performed. The gain of this amplifier was 500. 

The test signal V;, generated by a low-frequency function gen- 
erator, was also amplified before being fed to the multiplier. The 
gain of this amplifier was 10. 

For both d-c amplifiers the bandwidth was sufficient to permit 
dynamic errors below 20 cps to be neglected. In the block dia- 
gram of Fig. 2 these amplifiers have been omitted for simplicity. 

Multiplier. The multiplier was a pulse-width, pulse-height 
modulation type of multiplier with a bandwidth of approximately 
1000 cps and therefore produced negligible amplitude and phase 
error at pertinent control frequencies. The multiplier accepts 
two inputs z and y and produces a product z = zy/100. 

Low-Pass Filter. The low-pass filter consisted of a simple lag 
with the transfer function 


K, 
1 + Tsp 


(196 PEAK 


] TEST SIGNAL AMPLITUDE + 96 Ib/hr 


PRESSURE RATIO 


CONTROL OPERATING LINE 


© ENGINE STEADY STATE OPERAT 
ING POINT (OFF CONTROL) 


© DATA POINT OBTAINE 
CONTROL OPERATION 


ADJUSTED FUEL FLOW, Ib/br 


Fie. Controt Operatinc Line on ENGINE PrRessuRE-FUEL 
FLow CHARACTERISTICS 


(Test-signal amplitude, Ao = 1.41 volts; control gain, K = 0.05; integrator 
time constant, r = 0.05 sec.) 


6 
= 
a 
2 2 3 4 5 6 > 


ENGINE TEMPERATURE RATIO 


Fie. 4 Controt Operatinc Line on Encine Pressure Ratio- 
TemPEeRATURE Ratio CHARACTERISTICS 


(Test-signal amplitude, Ao = 1.41 volts; control gain, K = 0.05; integrator 
time constant, r = 0.05 sec.) 


where the gain K, was unity and the time constant 7; was 0.1 sec 
which gives a break frequency of approximately 1.6 eps. 

Control. A proportional-plus-integral control was chosen for 
the investigation; thus additional flexibility over a simple pro- 
portional control or integral control was obtained. Also, ex- 
perience has indicated that this type of control is suitable for a 
wide variety of systems. For this particular system where the 
predominant dynamics are in the filters, a proportional-plus- 
integral control appears to be especially desirable. 

The transfer function of the control was 


K (: 
Tp 


where the control gain K and the integrator time constant r were 
variable so that the control performance could be evaluated for 
different control settings. 

During a portion of the investigation, the proportional part of 
the control was disconnected to give plain integral control. The 
transfer function in this case reduced to 


K/rp 


where the effective control time constant is r/K. 

All of the data presented herein are for proportional-plus- 
integral-control action except those in Fig. 8 which were obtained 
with integral control only. 

Summer. The output of the control was added to the test signal 
as shown in Fig. 2. The gain from the control output to the fuel 
servoinput was unity, but the test signal was attenuated by a 


483 
NUMBER: Sng 
gs 
473 
/ 
| 
CONTROL 
OPERATING 
LINE 
RELATIVE 
| 
ff 


484 


factor of 3, resulting in a gain of '/; for that portion of the summer, 

Fuel Servo. The fuel system consisted of an electrohydraulic 
servosystem which positioned a fuel valve in response to an input- 
voltage signal. The frequency response of fuel flow to fuel servo- 
input voltage for the range of frequencies up to 20 cps is roughly 
equivalent to that of a second-order system with a damping ratio 
of 0.5 and a natural frequency of 10 eps. The gain of the fuel 
servo is 278 lb per hr per volt. 

Engine. To provide favorable conditions for control opera- 
tions, e test-signal frequency of 2 cps was selected. This choice 
of frequency was made for two reasons: (1) The phase shift in the 
engine was small at that frequency. (2) There were no peaks in 
the noise spectrum in that range of frequencies. 


BEHAVIOR OF EXPERIMENTAL SYSTEM 


Steady-State Perfo:mance. The control system described was 
operated at various simulated flight conditions. The steady-state 
performance attained over a range of flight conditions is illustrated 
in Fig. 3. The figure shows engine operation in terms of compres- 
sor-pressure ratio as a function of adjusted fuel flow. For a par- 
ticular altitude and Mach number the ordinate can be considered 
as the engine output or controlled pressure. The data have been 
plotted as a function of adjusted fuel flow to eliminate, as much 
as possible, variations due to burner efficiency and variations in 
Mach number, altitude and inlet temperature from the intended 
values at the simulated flight condition. The adjustments were 
made by multiplying the actual value of fuel flow by burner ef- 
ficiency and also multiplying by generalization factors for tem- 
perature and pressure. These generalization factors involve 
ratios of actual temperature and pressure to the exact values of 
temperature and pressure that should exist at the particular flight 
condition. These fuel-flow adjustments permit a more accurate 
presentation of the control operating line on the static character- 
istics of the engine. 

The steady-state values of pressure and fuel flow obtained dur- 
ing control operation are indicated in the figure. The control 
gain K was 0.05; the integrator time constant 7 was 0.05 sec; and 
the test-signal amplitude Ao was 1.41 volts. The test-signal am- 
plitude is indicated in the figure in terms of fuel-flow to show the 
magnitude of cyclic perturbations. 

The data indicate that very nearly the peak pressure was at- 
tained at all Mach numbers. Even at the lowest Mach number, 
where the slope to the right of the peak is very small, the system 
operated near the peak. It appears that the control is causing the 
engine to operate slightly to the right of the peaks over the range 
of Mach numbers. The amount of deviation from the peak in 
most cases is quite small and is within the range of the experi- 
mental error involved in determining the static characteristics. 

To establish the control operating line more definitely and per- 
mit easier visualization, the same steady-state performance data 
shown in Fig. 3 are presented in Fig. 4 in terms of compressor- 
pressure ratio and engine-temperature ratio. Since ratios are in- 
volved on both ordinate and abscissa, the engine characteristics 
can be fixed more accurately. Also, the peaks of the engine char- 
acteristics fall at nearly the same temperature ratio, resulting in a 
nearly vertical control operating line. Again the data points ob- 
tained during control operation fall slightly to the right of the 
peaks. 

There are at least two possible explanations for the position of 
the control line. First, any static error caused, for example, by 
drift in certain components such as the multiplier, would result in 
operation off the peak, and could cause some of the discrepancies 
shown. The second factor which certainly should cause operation 
to the right of the peaks is the result of different slopes below and 
above the peaks. The steeper slope to the left would give a signal 
during the cyclical excursion to the left which is stronger than 


TRANSACTIONS OF THE ASME 


that obtained on the right side. As a result, the system should 
settle out slightly to the right of the peaks. 

Another factor affects the value of pressure obtained during 
control operation. For test-signal excursions around the peak, the 
cyclic pressure resulting would have a lower average value than 
the peak of the static characteristics. This hunting loss is quite 
small, however. 

Transient Performance. The dynamic performance of the sys- 
tem was studied by subjecting the control system to step dis- 
turbances in fuel servo input voltage. A typical response of the 
control system to such a disturbance is shown in Fig. 5. The 
test signal amplitude A» was 2.83 volts (262 lb/hr in fuel flow); 
the control gain K was 0.05; and the integrator time constant 7 
was 0.2 sec. The control was operating in steady state when, at 
point A, the step disturbance was introduced causing the servo 
input voltage to change suddenly to a new value (point B). Asa 
result of the step change in servo input voltage the fuel-valve 
position and fuel flow also change rapidly to new values. Notice 
that just before the transient, there is a gap in the fuel-flow trace. 


WN | |. 


CONTROLLED PRESSURE 


OTHER ENGINE PRESSURES 


Fie. 5 


Response or Controu System To Step DisturRBANCE IN 
Fue. Servo Input 


(Relative Mach number = 0.847; step size = 2v [556 lb/hr]; test-signal 
amplitude, Ao = 2.83 volts; control gain, K = 0.05; integrator time 
constant, r = 0.2 sec.) 


This gap is caused by the recorder-channel identification system 
and coincidentally occurred at the same time as the transient was 
introduced. A very short time after the fuel flow changed, the 
controlled pressure responded to the disturbance (point C). 
Since the step disturbance was a step decrease in fuel flow, the 
controlled pressure dropped, and then responded periodically 
and approximately in phase with the fuel flow. The amplitude of 
the disturbance (556 lb/hr or 18 per cent of the steady-state 
value) was roughly equal to the peak-to-peak test-signal amplitude 
(524 lb/hr). The test-signal trace happens to be reversed in 
phase on the recording so that a deflection downward on the trace 
corresponds to deflections upward on the servo input voltage, 
fuel-valve position, fuel flow, and controlled-pressure traces. 

The controlled-pressure signal was filtered with a bandpass fil- 
ter, the output of which is shown directly above the test signal. 
Several cycles of test-signal frequency are clearly evident during 
the time interval from D to E just following the step input. 

The next trace above the bandpass-filter output is the multi- 
plier output which is the product of the test signal and the band- 
pass filter output. The double frequency or second harmonic 
produced as a result of the multiplication is also clearly evident 
in the time interval from D to E. In order to show the d-c com- 
ponent, a line has been drawn indicating the average steady-state 
value of the multiplier output before and after the transient. 
During the D to E time interval, the minimum value of the multi- 


i 
FUEL SERVO INPUT 
; FUEL VALVE POSITION fr 
j 
J 
Fut 
} 
if | BANDPASS FILTER OUTPUT 
j ‘ } 
» TEST 
j 
{ 


APRIL, 1957 


plier output signal falls along this line, indicating that the d-c com- 
ponent is one half the peak-to-peak value of the amplitude of the 
second harmonic signal produced by the multiplier. This d-c 
component is essentially constant in this interval and the integral 
action of the control causes the fuel flow to increase at a constant 
rate up to point Z. At that time the d-c level decreases as the 
system approaches the final value. In steady state, the multiplier 
output oscillates plus and minus a small amount as a result of 
excursions in fuel flow to the right and to the left of the peak. The 
oscillations in controlled pressure resulting from the test signal 
are barely discernible through the noise present in the engine 
during steady-state operation. 

Effect of Control Settings on Dynamic Performance. The control 
system was subjected to a series of disturbances similar to the one 
just discussed. For each of these transients, different control 
settings were used. The responses were then analyzed to deter- 
mine the response time and per cent overshoot. These were 
measured on the fuel servo input-voltage trace to facilitate meas- 
urements and to avoid difficulties in interpretation resulting from 
pressure reversals in passing through the peaks of the engine char- 
acteristics. A line was drawn through the fuel servo input oscilla- 
tions to indicate the average value of the oscillations during the 
transient. This average value was used for the calculations of 
response time and per cent overshoot. Response time was de- 
fined as the time from the disturbance until the transient first 
completed 90 per cent of its response. Per cent overshoot was 
defined as the ratio of the first overshoot to the disturbance mag- 
nitude. 

Response time and per cent overshoot are shown as a function 
of integrator time constant for a fixed value of control gain (K = 
0.1) and fixed disturbance magnitude (556 lb/hr) in Fig. 6. The 
test-signal amplitude Ao was 2.83 volts. The response time in- 
creases and per cent overshoot decreases as integrator time con- 
stant increases. This trend is as expected. The response times 


for step increases are larger than for step decreases. This trend is 
caused by the less steep slope to the right of the peak than that to 
the left. 


RESPONSE TIME (90 %), sec 


PERCENT OVERSHOOT 


24 


INTEGRATOR TIME CONSTANT, sec 


Fig. 6 Errecrt or IntecratTor Time ConsTaANT ON CONTROL 
System 


[Relative Mach number = 0.847; step size = 2v (556 ed wd test-signal 
amplitude, Ao = 2.83 volts; control gain, K = 1J 


The relation between per cent overshoot and response time is 
shown in Fig. 7. The figure indicates that the minimum re- 
sponse time without overshoot is approximately 1.6 sec for step 
increases and 1.3 sec for step decreases at a relative Mach number 
of 0.847. Per cent overshoot increases rapidly below these values 
of response time. 

Effect of Shape of Static Characteristics on Dynamic Perform- 


485 


ance. The shape of the static characteristics varies considerably 
over the range of simultated flight conditions as shown in Fig. 1. 
At high Mach numbers the characteristics tend to curve upward 
in the region near the peak, while at low Mach numbers, the 
characteristics tend to be rounded off near the peak, with a very 
small slope to the right. 

A series of responses was obtained at the different Mach num- 
bers with constant control settings and magnitude of disturb- 
ance. The test-signal amplitude Ay was 2.83 volts. Also, the 
proportional part of the control was disconnected, thus the con- 
trol action was integral only. Its effective time constant r/K was 
4sec. For all of the other data presented, the control was of the 
proportional-plus-integral type. Response times and per cent 
overshoot for this series are shown in Fig. 8. The response times 
for step increases are considerably larger than for the step de- 
creases, and for either direction of the step the response time 
decreases as the Mach number increases. Increasing Mach num- 
ber corresponds to moving toward the cusp-type peaks. 

There was no overshoot for any of the responses except the re- 
sponse to a step decrease at the highest Mach number. 

The reasons for the differences in response become more evident 
from a consideration of Fig. 9. This figure shows the engine gain 
or slope of the pressure-fuel flow static characteristics as a func- 
tion of fuel flow. Starting at low fuel flow, the engine gain is 
about the same magnitude and increases at about the same rate 
up to about 2400 lb per hr for all flight conditions. From 2400 lb 
per hr to the peak, the engine gain varies considerably for the dif- 
ferent Mach numbers. At high Mach numbers, the gain con- 
tinues to increase until the peak of the static characteristic is 
reached, and then changes abruptly to a large negative value. 


PERCENT OVERSHOOT 


\ 
4 


RESPONSE TIME, sec 


= 


Fic. 7 Retation Between Per Cent OversHoot anp RESPONSE 
Time 


(Relative Mach number = 0.847; step size = 556 lb/hr; 


test-signal ampli- 
tude, Ao = 2.83 volts; control gain, K = 0.1.) 


RESPONSE TIME (90 %), sec 


108 
RELATIVE MACH NUMBER 


Fie. 8 Errect or On System Response 


[Step size = 2v (556 lb/hr); test-signal amplitude, Ao = 2.83 br integral 
control only; effective ‘integrator time constant, r/K = 


486 


A short distance to the right of the peak the gain changes 
rapidly and approaches a small negative value. 

For the lowest Mach number, the slope decreases before the 
peak is reached, goes through zero at the peak, then levels off at a 
very low negative value. 

‘The signal, or error, to the control is proportional to the engine 
gain. The engine gain and hence this error is larger at higher 
Mach numbers, therefore, response times are shorter. Also, the 
error signal available on the right of the peaks is much less than 
that on the left, resulting in much slower responses for step in- 
creases than for step decreases. The response times for step in- 
creases can be made approximately the same as for step decreases 
by properly biasing the multiplier output. 

The abrupt change in engine gain from positive to negative at 
the peaks of the high Mach number engine characteristics results 
in control operation similar to that obtained with relay or on off 
type of controls. At the lowest Mach number the engine gain 
changes gradually through zero giving control performance more 
closely related to that in a linear continuous system for small dis- 
turbances to the left of the peak. 

Effect of Filtering on Performance. The effect of filtering was 
investigated to determine how sensitive this type of optimalizer 


TEST SIGNAL 
8 ro vane 0 


Sv 
a 
a \ 794 
dja 
a 
4 q 
TO -138 
FUEL FLOW, Ib/he 
Fig. 9 Variation or Encine Gain at DirreRENtT Macu 
NUMBERS 
= SERVO INPUT — FLOW (FUEL = SERVO WPUT VOL TAGE 
NOZZLE PRESSURE “OROP)) FUEL WALVE 


FUEL. VALVE POSITION 


PUE Low VEL 
PRESSURE DROP} 


TRANSACTIONS OF THE ASME 


control was to the large amount of noise present in the controlled 
pressure signal. 

The low-pass filter following the multiplier was removed and 
the bandpass filter upper cutoff frequency was varied. The 
steady-state response of the system is shown in Fig. 10 for various 
cutoff frequencies. In Fig. 10(a) the test-signal frequency was 
approximately 2 cps, the bandpass-filter lower cut-off frequency 
was | cps, and the upper cutoff frequency was 4cps. These were 
the original settings which were used during most of the investiga- 
tion. In Fig. 10(b), the upper cutoff frequency was raised to 40 
eps. The higher frequency components are apparent in the band- 
pass-filter output, multiplier output, fuel servo input, fuel-valve 
position, and fuel flow. 

When the upper cutoff frequency was raised to 400 cps, the 
amount of noise passing through the system was considerably 
more, as indicated in Fig. 10(c). In spite of the very large noise 
level, there was no indication of unsatisfactory performance. 

Performance for Various Test-Signal Amplitudes. In view of 
the large noise level present in the pressures in the engine, there 
was some concern as to the feasibility of operating with a reason- 
ably small test-signal amplitude. During the course of the study 
this amplitude was varied in an attempt to find a lower limiting 
value of signal amplitude that was suitable in the presence of the 
noise. It was found, however, that the noise in the system was 
not the factor which determined the minimum allowable test- 
signal amplitude. Difficulties were encountered during transients 
at the lowest test-signal amplitude. These difficulties were ap- 
parently caused by small imperfections in the pressure fuel-flow 
static characteristics. Data have indicated that in many cases 
small deviations or peaks occur in regions on both sides of the 
main peak. These usually can be attributed to pressure-profile 
effects, and many times are not apparent from ordinary plots of 
steady-state data points. When such peaks have been found, they 
have been obtained by recording pressure as a function of fuel 
flow in a continuous plot on an X-Y recorder. 

Although the system operated satisfactorily in steady state 
when the test-signal frequency was not apparent in the controlled 
pressure signal, during transients, the system would hang up or 
seek what apparently was a minor peak in the engine character- 
istic. This difficulty no longer occurred after returning to the 
larger test-signal amplitudes. 

Behavior for Higher Test-Signal Frequencies. To insure success- 
ful control operation, a test-signal frequency of approximately 2 
cps was used for most of the investigation. Since higher test-sig- 


Tow 


(a) Bandpass filter low cutoff frequency = 

l cps; bandpass filter high cutoff frequency 

= 4 cps; low-pass filter cutoff frequency = 
1.6 eps 


(b) Bandpass filter low cutoff frequency = 
leps; bandpass filter high cutoff frequency 
= 40 cps; low-pass filter removed 


(c) Bandpass filter low cutoff frequency = i 
eps; bandpass filter high cutoff frequency = 
400 cps; low-pass filter removed 


Fic. 10 Errect or Fitter Cutorr Frequency 


(Relative Mach number = 0.847; test-signal frequency = 2 cps; 


test-signal amplitude, Ay = 2.82 volts (262 lb/hr); 


control gain K = 0.1; integrator 


time constant, r = 0.1 sec.) 


Hie 


APRIL, 1957 


TEST SIGNAL FREQUENCY 


= 6 
4 i 4 1 
2 3 4 5 ? 


ENGINE TEMPERATURE RATIO 


Sreapy-State PERFORMANCE OF CONTROL FOR VARIOUS 
Test-SIGNAL FREQUENCIES 


(Test signal amplitude Ae = 1.41 volts; control gain, K = 0.05; integrator 
time constant, r = 0.1 sec; relative Mach number = 1.052. 


Fig. 11 


4 
© CONTROL OPERATING POINT 
+ 
a cos @ 
a 10 
N 
x 
= 
cos @ 

‘10 
g 
i+ 

0 4 ‘8 22 26 30 34 38 42 


FUEL FLOW, LB/HR 


Fic. 12 Errect or Puase Suirt on Error VoOLtTace 
(Test-signal amplitude, Ao = 1.41 volts; relative Mach number = 1.052.) 


nal frequencies would permit higher filter cutoff frequencies and 
hence, faster response, data were taken to determine the per- 
formance of the system under such conditions. 

The steady-state behavior of the system at several test-signal 
frequencies is shown in Fig. 11. No attempt had been made to 
correct for the phase shift due to the fuel servo and engine at these 
higher frequencies. Nevertheless, the system operated satisfac- 
torily in steady state for frequencies up to 4 cps. 

The error voltage to the control for these frequencies is indicated 
in Fig. 12. As the phase shift increases from 0 to 90 deg, the error 
voltage to the control (A A;/2) cos @ is reduced, and for 90-deg 
phase shift becomes zero. Operation at 90-deg phase shift there- 
fore is impossible. For frequencies which produce less than 90-deg 
phase shift, the control operates at point A, which corresponds to 
operation at peak pressure. 

When the frequency was changed from 4 to 5 eps, the operating 
point suddenly shifted from the peak to a point well to the right 
of the peak as indicated in Fig. 11. This sudden change in operat- 
ing point occurred because the phase shift increased beyond 90 deg. 
For phase shifts greater than 90 deg (and less than 270 deg), cos 0 
is negative. When cos @ is negative, the control is unstable (runs 
away ) at the peaks. 

The conditions under which the system can operate stably far 
to the right of the peaks are illustrated in Fig. 13, where the error 
voltage to the control (AoA;/2) cos @ is shown again. In this 
case, however, the curves are inverted because cos @ is negative. 
Stable operation at high fuel flows is possible if a small amount 
of d-c bias exists (caused for example by drift in the multiplier). 
The operating point shown in Fig. 11 for a test-signal frequency 


© CONTROL OPERATING POINTS 


OC. Bias cos 8 


“3. UNSTABLE REGION FOR D.C 
/ SHOWN AND @ «+ 90° TO 120 


BIAS 


ERROR VOLTAGE TO CONTROL WITH ZERO BIAS, VOLTS 


— 
FUEL FLOW, L8/HR 


Fic. 13 Errect or Puase Surrt on ENGINE Point For 
ABNORMAL OPERATING CONDITIONS 


of 5 eps corresponds to operation at point A in Fig. 13. Increas- 
ing the test-signal frequency from 5 to 12 cps caused the phase 
shift to increase further. As the phase angle increased, cos @ 
increased in magnitude. The increase in error voltage to the 
control as a function of cos @ caused the operating point to shift 
as illustrated by points A and B in Fig. 13 and as shown in Fig.11 
for frequencies from 5 to 12 cps. 

The effect of component dynamics on A, in the term (AoA;/2) 
cos @ must of course also be considered. In this case, however, 
variations in A, are small compared to variations in cos @. 

To alleviate the difficulties encountered here, and possibly per- 
mit operation at even higher test-signal frequencies, the test signal 
fed to the multiplier should be shifted in phase to compensate for 
the phase shift through the fuel servo, engine, and other com- 
ponents in the loop. af 


CONCLUSIONS 


The experimental data indicated that the continuous test-signal 
optimalizer control, applied to the control of compressor-output 
pressure in a flight-propulsion system, gave very nearly maximum 
output pressure over a range of flight conditions. 

For a test-signal frequency of 2 cps and with a large amount of 
filtering in the system, response times of less than 2 sec were ob- 
tained with only a small amount of overshoot. These response 
times can be shortened considerably by increasing the test-signal 
frequency and decreasing the amount of filtering. 

The type of response varied greatly over the range of Mach 
numbers owing to the variation in the shape of the engine static 
characteristics from one Mach number to another. 

The data indicated that good filtering was not required. The 
system could therefore be made to operate faster by removing part 
or all of the filtering. 

The minimum test-signal amplitude was found to be dictated 
not by the signal-to-noise ratio, but by imperfections or minor 
peaks in the engine static characteristics. For very small test- 
signal amplitudes, the control would hang up on a minor peak. 


Discussion 


Y. T. Li.* The author is to be congratulated on this excellent 
paper. The presentation of the problem is well done and the ex- 
perimental coverage is quite complete. An optimalizing system, 


? Department of Aeronautical Engineering, Massachusetts Insti- 
tute of Technology, Cambridge, Mass. 


487 
3 
25 : 
SS 
= = A... - 
101.5° 
/ - 
— 
| 


488 


like any control system, must prove its practicability by being 
subjected to the disturbance environment the system is likely to 
encounter. The extensive study of the noise effects upon the 
operation of the tested system fulfills this requirement quite well. 
The author also has established the response speeds of the system 
at different operating conditions. To be more inquisitive, the 
writer would like to ask this question: How does this response 
speed compare with the actual drift speed of the operating condi- 
tion of the engine under flight conditions? 

In so far as the basic purpose of the control system is con- 
cerned, the use of the engine pressure ratio as the output, and the 
fuel flow rate as the controlled input, indicates that the goal is to 
maintain a maximum delivery of thrust without the consideration 
of fuel consumption. Clearly, if the economical use of fuel is the 
primary purpose, then the output might have been the ratio of the 
pressure ratio divided by the fuel rate. 

To get an optimum operation we usually have two possible 
schemes; i.e., program-type control and an optimalizing control. 
The final choice between the two depends upon the relative com- 
plexity of the system. The writer wishes to know whether the 
author has considered the possibility of getting similar control per- 
formance with a programmed-type controller, and whether it is 
more complicated to attempt this? 

In so far as an optimalizing controller is concerned, a continuous 
test signal is 2 neat system and has a lot of appeal to electronic 
systems operating at many thousands of cycles per second. For 
mechanical systems with hunting periods down to the order of a 
second, it is the opinion of the writer that a peak-holding type 
might be considerably more simple and flexible. 


AvuTHOR’s CLOSURE 


The author appreciates the complimentary remarks and excel- 


lent discussion offered by Dr. Li. It is a privilege to have one of 
the inventors of optimalizing controls discuss this paper. 

IQr. Li asks several rather interesting questions. In his first 
question he asks how the response speeds of the system, as 
investigated, compare with the actual drift speed of the operating 
condition of the engine under flight conditions. 

The answer to this question will depend upon the particular 
application of the engine. For certain vehicles and missions, the 
response times given are adequate. For others, the speed of re- 
sponse is not so fast as desired. The response speeds shown, 


TRANSACTIONS OF THE ASME 


however, are for the system with a large amount of filtering. It 
was shown that the system would operate with all of the filtering 
removed. Since the response time would improve considerably 
as the filtering is reduced, it is the author’s opinion that response 
times could be obtained with an optimalizing control system 
which would approach the best that can be obtained with stand- 
ard nonoptimalizing control systems. Once the filtering has been 
removed, the author feels that the basic limitations on speed of 
response of the two types of systems are the same. (Actually, it 
is not suggested that all of the filtering be removed. How much 
filtering exists, either intentional or unavoidable, will again de- 
pend upon the specific application and on the hardware employed. 
Nevertheless, it is felt that the response times shown can be con- 
siderably improved. ) 

The second question involves the variable being optimalized. 
Apparently, the author’s use of pressure ratio was confusing. 
The quantity actually sensed, as shown in Fig. 2, was an engine 
pressure, P,. The plots, however, were made in terms of pres- 
sure ratio for convenience. 

The economical use of fuel was not discussed in the paper and 
Dr. Li’s observations in this regard are well made. It turns out, 
however, that minimum ‘specific fuel consumption occurs at or 
near maximum pressure and therefore use of the ratio of pressure 
to fuel flow as the output is not necessary in this case. 

Programmed-type controllers have been used successfully in 
many applications. A program-type control depends upon the 
calibration of the engine to get the required program. As the 
number of variables (Mach number, altitude, angle of attack, 
etc.) which affect engine operation increases and as the range of 
these variables increases, the calibration becomes more difficult 
to obtain and also less reliable. In addition, changes occur in the 
engine after the calibration has been made. 

It is the author’s opinion that an optimalizing control could be 
made which is less complicated and more reliable than many of 
the existing engine controls. Operations such as filtering, multi- 
plication, and signal generation can be performed by relatively 
simple devices which can be mechanical, electrical, electronic, 
acoustic, or other types. Utilizing such devices, simple con- 
tinuous test-signal optimalizing controllers can be designed to 
operate over a wide range of frequencies. Although the peak- 
holding type might be more simple and flexible, it would very 
likely not be so insensitive to noise as the continuous test-signal 


type. 


Representation of Nonlinear Functions of 


Two Input Variables on Analog Equipment 


By D. A. ELLIOTT,’ CALDWELL, N. J. 


When an analog operator who has been solving problems 
on a linearized basis begins to employ nonlinear com- 
ponents to represent a wide range of operation of the simu- 
lated system, it soon becomes apparent that many of the 
key relations in the system are nonlinear functions of two 
or more input variables. Function-generating equipment 
that will develop nonlinear functions of a single input 
variable is now available for many types of analog equip- 
ment, but equipment that will generate directly three- 
dimensional families of curves as arbitrary nonlinear func- 
tions of two input variables exists only in a few cases of 
highly expensive specialized equipment. Some methods 
also have been described for developing three-dimensional 
curves by reworking standard function-generating equip- 
ment to perform the additional mathematical operations 
required by a second input variable. This paper describes 
a graphical method of matching three-dimensional func- 
tions by the use of standard analog components without 
modifications. 


INTRODUCTION 


HE method which is described here uses standard analog 
I components without any modifications for accomplishing 
the simulation of three-dimensional functions. A graphi- 
cal method of matching the desired family of curves is used to 
determine the necessary analog relations. This provides a simple 
method having reasonably good accuracy, which is readily adapta- 
ble to a large variety of functions. Although the method de- 
pends upon a somewhat regular progression among the lines in a 
family of curves, most functions do have such a progression. 
With this method it has been possible to match all! curves which 
have been encountered in the simulation of several types of ram- 
jet and turbojet engines with sufficient accuracy to achieve the 
aim of analog computation; i.e., the study of dynamic-stability 
characteristics in control-analysis problems. For such studies, 
the principal requirement is that the correct slopes, or partial 
derivatives, be present to relate the variables at any particular 
operating point. 
Basic PROCEDURES 


In order to describe this method, it is first necessary to define 
the terms to be used. The familiar procedure of using delta 
increments is useful for nonlinear functions in a manner similar to 
that of linear-analog systems, where voltages represent excursions 
about a steady-state point. When used with nonlinear systems 
this permits greater accuracy, since the range of variation of each 
variable can be made to cover the full range of variation of avail- 


1 Project Engineer, Curtiss-Wright Corporation, Propeller Division. 

Contributed by the Instruments and Regulators Division of Tue 
AMERICAN Socrety oF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, 
January 19, 1956. Paper No. 56—IRD-11. 


able voltage. An illustration of the delta method as applied to a 
fixed coefficient is provided by Fig. 1. This relation is valid for 
either linear or nonlinear analog representations, and serves to 
define the terms which will be used. 


y Real system Vy | 4Y a 
vous} Ux 4X 
by/vy 
REFERENCE POINT ax 
Yo fe) = 
- + Vy, VOLTS ——— 
uy Vx 
vx 
ANALOG 
° 


Fie. 1 Frxep Corrricient 

In a typical linear analog representation, a steady-state operat- 
ing point of the simulated system is selected, and the variables in 
the system are mutually related by the fixed coefficients and 
dynamic relations that exist at that point. The steady-state 
point is normally zero volts for all variables. Excursions of the 
variables in response to applied disturbances occur as positive or 
negative delta amounts representing variations above or below 
the defined steady-state point. Take, for example, two variables 
z and y which are related by a coefficient C. If the initial steady- 
state points for these variables are constant terms defined as zo 
and yo, then 


ake 


where Az and Ay are the excursions about the steady-state point. 
Relating the variables by the coefficient C, the following relations 
exist 


vo + Ay = C + Az)............... [5] 


Various methods may be used to apply scale factors which will 
express the variables in terms of analog voltages. A typical 
method uses scale units which define the amount of each variable 
per volt on the analog. The units are defined as constant terms 
u, and u, which relate the delta excursions to analog voltages 
according to the expressions 


Az 


= v, 


489 


ay 


Uy 


In these equations v, and », are variable voltages representing 
Az and Ay on the analog equipment. The value of each scale 
unit is determined by the amount of delta excursion which is to be 
represented by the maximum analog voltage. The dimensions of 
the seale units will be determined by the dimensions of the corre- 
sponding variables. For example, if variable z has dimensions of 
rpm, then scale unit u, has dimensions of rpm/volt. 

In using the scale units to calculate the gain settings for analog 
amplifiers, the significant relations from Equations [6], [7], and 
[8] are 


. [9] 


u,v, = Cur, 


[12] 


[10] 


The term G is the gain value which must be set on an analog 
amplifier to represent the coefficient C with scale factors included. 


FUNCTION OF A SINGLE VARIABLE 


A nonlinear function of a single input variable represents a 
coefficient that varies with the magnitude of the input. Fig. 2 
illustrates such a function. Several methods are available for 
matching this type of curve on analog equipment. These include 


y (x) 


+ Vx, | 


vx FUNCTION 
GENERATOR 


Xo 


Fie. 2 Function or ONE VARIABLE 


servo-operated devices to move pickup resistors across wire 
which has been cemented to a graph, photoformers using a mask 
on the face of a cathode-ray tube, function-fitter components 
using diode circuits to make curves consisting of several straight- 
line segments, and polynomial equations to express the curve 
mathematically. The principles which are used here apply to 
any of these methods but are particularly intended for the func- 
tion fitters which cover the full range of both positive and nega- 
tive analog voltages. By using this full range of voltage to cover 
the expected excursion of each variable, the best accuracy is 
obtained from the analog equipment. 

To apply a curve of this type to the analog, a reference point is 
selected on the curve at the approximate center of the expected 
range of variation. The values of the variables at this point are 
the reference values z and yo. Values for the scale units are 
determined in the same manner as the fixed coefficient, by the 


TRANSACTIONS OF THE ASME 


amount of excursion which is to be represented for each variable 
divided by the maximum analog voltage. Once the scale units 
have been selected, voltage axes may be applied to the plotted 
curve with their origin at the reference point. The voltage 
values are determined from the relations 


u 


[13] 


z 


y— 


14 
u 


Once the voltage values have been applied to the curve, scaling 
has been completed for the analog. When the function-fitting 
component is set up to represent the plotted curve in terms of 
volts, the correct system relation will be obtained in the analog 
circuit. 


TRANSLATION OF CURVES 


Plots of nonlinear functions of two input variables generally 
show a similarity or regular progression among the lines in the 
family of curves. In some cases this appears to be a translation 
or shifting of the curves, while in others a rotation, or fanning out 
of the curves is apparent. Procedures for accomplishing such 
translations and rotations on analog equipment make possible the 
matching of families of curves in a simple graphical manner. 

The most frequently encountered case is that of translation. 
The analog mechanism for causing such shifts is to add voltages 
to the input and output of the function-fitting component which 
generates the basic shape of the curve. 

Vertical Shifting. The effect of adding a voltage to the output 
of a function is illustrated by the plot in Fig. 3. The obvious 
result is that the curve is displaced vertically by an amount equal 
to the added voltage. 


vy=f (vx) + oc 


Vx __| FUNCTION 
GENERATOR 


Xx XO 


Fic. 3 SHIFTING 


Horizontal Shifting. The effect of adding a voltage to the in- 
put of a function is illustrated in Fig. 4. In this case the curve is 
displaced horizontally, but an amount opposite in sign to the 
added voltage. A positive voltage added to the input causes the 
curve to shift to the left, or negative direction. 

Combining Horizontal and Vertical Shifts. Since a mechanism 
is available for causing translation of curves in any desired direc- 
tion, it is only necessary to control the amount of translation in 
response to a second input variable in order to match families of 
curves which are nonlinear functions of two input variables. 

Fig. 5 illustrates the circuit required to produce combined 
horizontal and vertical translation in response to a second input 
variable. Two arrangements are shown: A block diagram, and a 
typical analog representation. In the latter the change of sign 


490 
| 
v, = Ge, .. = 
Uy 
| | 
vy 
+ + DC 
Yo 
y 0 
| 
0 


APRIL, 1957 


produced by each analog amplifier is included. This circuit 
develops a voltage v, which is related to one input v, by the 
function 


fi (vz) 


and is translated horizontally and vertically by the second varia- 
ble input voltage v,. The horizontal shifting function is f, and 
the vertical shifting function is f;. Notice that in the analog dia- 
gram, the horizontal function need not be expressed in the nega- 
tive sense since the reversal in sign of the analog amplifier causes 
the necessary negative action. 


vy=f (vx + 


— — 


\ 
votts ——= * 


FUNCTION 
GENERATOR 


+ Vx, 
+ DC 


xo x 


Fic. 4 HorizontTat SHIFTING 


ANALOG DIAGRAM 


Fic. 5 Compinep Horizontat VERTICAL TRANSLATION 


Determination of Shifting Functions. The method of deter- 
mining the horizontal and vertical shifting functions is the most 
important step, and greatly influences the degree of accuracy 
which is obtained. To illustrate the procedure, the plot shown 
in Fig. 6 provides a good example of a three-dimensional function 
such as may be found in turbojet-compressor relations. As with 
the function of a single variable, a reference point is selected in the 
approximate center of the family of curves. All variables have 
zero voltage at this point. It is desirable, but not essential, that 
this point fall on one of the plotted curves. This line then repre- 
sents zero volts for v,, the second variable. 

A transparent overlay having a single basic curve is plotted so 
that by successive shifts, it can be made to match each curve of 
the family. As the curves become steeper on the right side of the 
plot, they match the lower portion of the basic curve. In the less 
steep region of the left side, they match the upper portion of the 
basic curve. In matching the curves, care must be taken to keep 
the grid lines of the overlay parallel with the grid lines of the 
function being matched. 


HORIZ. SHIFT 
VERT. SHIFT 


Fic. 6 Typicat Function 


HORIZONTAL SHIFT VERTICAL SHIFT 


+100 +100 


fa f3 { 


Fic. 7 Sw#irtinc Functions 


To determine the shifting functions, the reference point is 
marked on the basic curve of the overlay. As this curve is 
shifted to each line of the family of curves, the point to which the 
reference point has moved is marked on each line. These points 
define the required amount of vertical and horizontal shift. As 
shown in Fig. 7, the amount of horizontal shift may be plotted 
as a variation of input voltage v, for each increment of v, to form 
fe; and similarly the vertical shift f; is plotted as the variation of 
v, corresponding to each increment of v,. The shifting functions 
generally are not elaborate curves, and frequently can be matched 
by lines having two or three straight-line segments which require 
only simple limiter or diode circuits. Undesired nonlinearities 
obtained in first plots of the shifting functions can often be 
eliminated without undue sacrifice of accuracy by a second 
matching of the overlay to the curves, with judicious weighting 
in the desired direction where necessary. 

When the basic curve of the overlay is applied to the analog 
circuit as f;, and horizontal and vertical shifting curves are 
applied as fz and f;, the desired three-dimensional relation will be 
generated. Interpolation between lines of the family of curves 
will follow the shapes of the shifting functions. 


RotTaTep CurvEs 


Families of curves which show a definite angular change be- 
tween lines, or fan out from the origin, may best be represented by 
including rotation in the analog representation. This requires 
the use of multiplier components in one of several manners. 

Direct Multiplication. In order to describe the analog methods 
for simulating curves which include multiplication, it is first 
necessary to establish the procedure for handling multiplication 


491 
100 % +100vtVy = 40V. 
Vy=20Vv. Vy = 60 v. 
Vy =-20V 
Vy=- 40V vx +100v 
Vy=-60V Yo y=80% 
z 
y 
10% 
y= 
| 
VOLTS 20% Xo 100% 
y 
° 
- 
| 
-100 +100 +100 
vy 
fa | 
+OVx 
BLOCK DIAGRAM 
+Vy 
{ 
| 


492 


when the delta and reference method is used for expressing vari- 
ables. A graphical representation of multiplication is shown in 
Fig. 8. This plot illustrates the first quadrant of the product zy 
and consists of a series of straight lines radiating from the origin. 
Reference lines 2» and 2 also are plotted to indicate the zero-volt- 
age axes of the analog simulation. 


Fig. 8 MULTIPLICATION 


Although the reference, or zero-voltage, point of the problem is 
displaced from the real origin when using the reference and delta 
method, multiplication of one variable by another can be accom- 
plished easily. The process which is used improves the accuracy, 
since a portion of the computation is performed by normal linear 
equipment so that any inaccuracies in multiplier components are 
minimized. The method is illustrated by the following example: 

Real equation 


In reference form 
(zo + Az) = (zo + Ax) (yo + Ay)..........[17] 
This becomes 
zo + Az = + toAy + yAr + 


Eliminating the constant values 


results in the expression 
Az = + + Ardy 


AZ* XQ YoOx+Aax by 


Fic. 9 MULTIPLICATION 


The analog circuit required for multiplication is shown in the 
block diagram of Fig. 9. 
In analog form the relations become 


TRANSACTIONS OF THE ASME 


UV, = + yous, + (u,v,)(uyy,)......... [21] 


u, u, 


The voltage v, is thus the summation of three terms. The first 
two require only linear coefficients, while the third requires a 
multiplication as well as a coefficient. 

Division. By slight revisions in the arrangement of terms, 
division also can be accomplished in a very simple manner. 

For a real equation of the form 


. [23] 


the analog equation becomes 


U,Yo Yo 


This is again a summation of two terms using linear coefficients 
and a third using a coefficient and a multiplier. 

Multiplication of a Single Nonlinear Function. The most 
obvious and direct type of three-dimensional function is one in 
which a nonlinear function of one variable is directly multiplied 
by a second variable as shown in Fig. 10. A three-dimensional 
function of this nature is described by the equation 


z = Cy f(a) [25] 


xO 


Fic. 10 MULTIPLICATION OF SINGLE NONLINEAR VARIABLE 


To perform this operation on the analog, the method for develop- 
ing a single nonlinear function is applied to the first variable. 
The output of this computation becomes one of the inputs to the 
multiplying arrangement which has been described. The second 
variable is applied to the other multiplier input through a suit- 
able coefficient to produce the necessary angular variation be- 
tween lines in the family of curves. 

General Method for Rotated Functions. Many functions that 
involve rotation can be matched by generating a series of rotated 
straight lines by means of multiplication, and adding a non- 
linearity tothem. An example of such a function is illustrated in 
Fig. 11. To determine the analog relations for matching curves 
of this type, an overlay method is again used. By shifting and 
rotating the overlay, a series of radiating lines is established 
which defines the necessary multiplication. The basic nonlinear 
function, and the necessary shift and rotation functions are deter- 
mined by the relation of these radiating lines to the desired family 
of curves. The following steps establish the multiplication and 
matching functions in voltage terms for application to analog 
equipment. 

Basic Nonlinear Function, f;. In a three-dimensional function, 
z = f(z, y), establish a reference point on one basic line in the 
approximate center of the family of curves, and assign voltage 


| 
| 
z ‘ 
.... [24] 
F- 
z vy=0 vy 
zo Vx , VOLTS 
ax 
Xoay 
Yo [¢ | | | xo 
4z 
1 


APRIL, 1957 


\ 
vy 
REFERENCE POINT VOLTS 


Vy=0 
+ Vx , VOLTS —— 


CENTER OF ° 
ROTATION 


Fie. 11 Roratep FUNCTION 


scales to each variable. 
point. 

Draw a tangent to the basic line at the reference point. The 
difference between the basic line and the tangent in terms of 
voltage v, at each value of v, defines the basic nonlinear function, 


hi 


All voltages are zero at the reference 


Center of Rotation and Shift Points. Copy the basic line with 
the reference point and the tangent line on a transparent overlay. 
Shift and rotate the overlay to match each line of the family of 
curves. Establish a point of intersection for the tangents which 
will satisfy all curves. This is the ‘‘center of rotation’’ for the 
multiplication. 

On each line of the family of curves, mark the point where the 
reference point of the overlay falls when matched. These are 
“shift points.’’ 

Horizontal Shifting Function, f, A horizontal shifting func- 
tion is established from the horizontal magnitude of each ‘‘shift 
point’’ expressed as a change of v, for each value of v,. As in the 
case of translation, this function is negative in sign. 

Multiplication Equation. Establish, in terms of the variables 
x and z, shifted scales which have the same units but originate at 
the “center of rotation.’’ Magnitudes on these scales are desig- 
nated z, and z,. 

Each ‘‘shift point’’ will now have co-ordinates in terms of z, 
and z,. Those for the reference point will be z and z,. 

Solve for y,, the value of the tangent at each shift point from 
the expression 


. [27] 


Rotation Function, f;. Multiplication about the “center of 
rotation’ is now defined. Since this is not necessarily propor- 
tional to the second input voltage v,, a rotation voltage v, is 
- established. This is related to v, by a function f; in a manner 
similar to the vertical shifting function used in translation, and is 
found as follows: 

Establish a scale unit u, for the rotation voltage. This unit is 
based on the maximum excursion of the voltage v, and the value 
of the tangent y, at that condition, using the expression 


493 


. [29] 


= 
Vy max 


Determine the voltage v, corresponding to the tangent y, for 
each line of the family of curves from the expression 


Yr — Yro 


= [30] 
Plot v, versus v, to obtain the rotation function fy. 
Analog Arrangement for Rotation. Fig. 12 illustrates the analog 
circuit required to represent a function of two input variables 


Yro Ux 
uz 

Xro Ur 
uz 

ux Ur 

. 


- 4Vx) 


fi 


vx 


Fie. 12 ANALOG ARRANGEMENT FoR Rotatep Functions or Two 
InpuT VARIABLES 


involving rotation. The multiplication is in terms of one input 
voltage v, and the rotation voltage v, in accordance with the 


equation 
u, u, 


The total output voltage v, is the summation of the voltages 
developed by function f; and the multiplication 


= Ua + Vem 


The additional shifting and rotation functions f, and f; are 
included at the appropriate points to add to v, and develop », in 
response to the second input voltage v,. 


APPLICATIONS 


By using the principles of translation or rotation which have 
been developed here, some combination can be obtained to match 
almost any nonlinear function of two input variables which has 
reasonably uniform progression among the lines. Using these 
methods, it has been possible to match turbojet-compressor 
curves within approximately '/: per cent of full scale within the 
working range, with the error increasing to the order of 2 per cent ~ 
at the limits of the function. The accuracy of the simulation can 
be checked graphically before applying the circuits to the analog 
equipment. 

Since only standard analog equipment is used, the circuits may 
be drawn in a normal manner once the constants have been deter- 
mined. Interconnections among analog components can be 
made easily from the circuit diagrams by anyone familiar with 
analog connections, since no special revisions to components are 
required. By using all-electronic function generators and multi- 
pliers, no objectionable dynamics are added to the problem. 

Extension of these methods to functions of three input vari- 
ables is also possible. If such a function can be plotted as a 
series of families of curves, the transitions between families can 
be determined. Suitable shifts can then be applied in much the 


VOLTS 4 ane 
{| 
z = 
xro 
1 
| 5 Xo G2 
Vy  f3| G3 
at the reference point, the tangent will be : 


494 


same manner as used to convert functions of a single variable to 
functions of two. Obviously the amount of equipment required 
and the complexity of the problem increase when an additional 
input variable is introduced. 


Discussion 


E. 8S. SHerrarp.? Other methods than those described in the 
paper may be used for generating a function of two variables 
with electronic components. Methods for handling quite arbi- 
trary functions of two variables have been described by Jerrard 
and Jacobi,? and by Philbrick.‘ The isocline method of Meis- 
singer® is useful for generating a particular class of functions of 
two variables. Since the equipment used by the author is 
simple, cheap, and much more frequently available than is the 
equipment used in Meissinger’s method, the writer would appre- 
ciate his comment on the following: 


1 What functions of two variables may be generated by 
both his method and Meissinger’s method? 

2 What functions of two variables, if any, may be easily 
generated by his method but are difficult or impossible to gener- 
ate by Meissinger’s method? 

3 What functions of two variables, if any, are difficult or im- 
possible to generate by his method, but are easily generated by 
Meissinger’s method? 


The author’s method is capable of generating exactly or ap- 
proximately the family of curves 2(z, y;), according to whether 
Equations [33] and [34] or Equations [35] and [36] are exactly 
or approximately satisfied 


2(z, yi) = fi(w) + fa(dy,) (see Fig. 5).. [33] 


w = 2 + fldy;), dy; = — wo. . (34) 


2(z, ys) = fi(w) + afx(dy,) (see Fig. 11). (35} 


w =z + fr(dy;), dy; = ys — wo. . [36] 
where the “reference function’’ f,(w) is taken equal to 2(z, yo), yo 
being the reference value of y, the parameter of the family. In 
the foregoing equations f2(dy,) is the horizontal shift function and 
fildy;) is the vertical shift function illustrated in Fig. 7 of the 
paper. The author’s method of determining these shift functions 
is a cut-and-try procedure which requires the drawing of an 
overlay of f,(w). Such a method may be the best approach if 
Equations [33] and [34] or [35] and [36] are satisfied approxi- 
mately rather than exactly. If these equations are not satisfied 
approximately, the cut-and-try procedure may be both tedious 
and ineffective. 

A simple test for the satisfaction of Equations [33] and [34] 
may be made by using the first differences with respect to x of 
2(z, y;) and the first differences of f,(w) with respect to w. If 
Equation [33] is satisfied, then differentiation of Equation [33] 
yields 

2 Analog Systems Section, Data Processing Systems Division, 
U. 8. Department of Commerce, National Bureau of Standards, 
Washington, D.C. 

3 “Generation of a Function of Two Variables,” by R. P. Jerrard 
and G. T. Jacobi, presented at the Annual Conference of the Asso- 
oo for Computing Machinery, Cambridge, Mass., September, 

‘Continuous Electric Representation of Non-Linear Functions 
of n-Variables,” by G. A. Philbrick, A Palimpsest on the Electronic 
Analog Art, G. A. Philbrick Researches, Inc., 1955. 

§ “An Electronic Circuit for the Generation of Functions of Several 
Variables,” by H. F. Meissinger, Institute of Radio Engineers Spring 
Meeting, 1955. 


TRANSACTIONS OF THE ASME 


oz dw 
Equation [37] will be satisfied for corresponding values of w and 
t, 21), (we, Zz), (ws, zs), ete. The values of 22, x3, etc., 
corresponding to chosen values of w,, w2, w;, etc., may be found 
by determining from the difference tables of 2(x, y;) the values 
of z at which the first differences of z(z, y;) are equal to the first 
difference of fi(w). Then, if Equation [34] is satisfied f.(dy,) 
is given by 


fr = WwW, — = We — — ete. 
Also, for Equation [33 ] 
= 2(21, —fi(wi) = 2( 22, ys) — filwe), ete. 


The constancy of f2(dy,;) and f,(dy;) as w and z assume the values 
of (wi, 21), (We, 2), (ws, 23), etc., determines whether or not Equa- 
tions [33] and [34] are good or poor approximations for the given 
function 2(2, y;,). 

If 2(z, y,;) is satisfied by Equations [35] and [36], the second 
differences of z2(z, y;) and f;(w) may be used to test Equations 
[35] and [36]. Differentiation of Equation [35] yields 


d*2(z, d*f,(w) 
38 

oz? [38] 
From the tables of second-order differences, corresponding values 
(w;, 21), (We, 2), (Ws, Zs), ete., that satisfy Equation [38] may be 
determined. Then, f2(dy;) is again equal to 


= = Ws — ete. 


and f,(dy,) is determined from Equation [35] with (w, x) set equal 
to (wi, 2), (we, Ze), (ws, Zs), etc. As before, the constancy of 
fedy;) and f,(dy;) will determine whether Equations [35] and 
[36] are a good or a poor approximation of 2(z, y;). 

An accurate graphical solution of Equation [37] may be made 
by using the well-known “mirror method” of graphical differ- 
entiation. To do so, one may determine the slope of fi(w) at a 
chosen value of w, w;. 2, is the value of z at which the slope of 
f(x, y;) is equal to the slope of f;(w) at w = w;. Other pairs of 
corresponding values (we, 22), (ws, 3), ete., may be found by the 
same method. Then, as before, 


= — W2 — Ws — Zp, ete. 


and f,(dy,) is found from Equation [33] for (w, x) equal to (w,, 2:), 
(we, 2), (wa, 22), etc. There does not seem to be any simple and 
accurate graphical method for determining corresponding values 
of w and z that satisfy Equation [38]. 


AvTHor’s CLosURE 


The comments of Mr. Sherrard are greatly appreciated. The 
methods which he describes add a considerable improvement in 
mathematical treatment to the graphical principles which have 
been described. 

In answer to his specific questions concerning the differences 
between the Meissinger method and that described here, the fol- 
lowing comparisons may be made: 


1 Both methods can generate functions which have constant 
spacing between isoclines. 

2 Functions which contain intersecting lines of y = const 
may be generated easily by the method shown in the paper. 
This is a more difficult case for the Meissinger method than that 
of nonintersecting curves having a laminated pattern. 

3 The method described here cannot generate functions in 
which the curves of y = const are greatly dissimilar in basic 


APRIL, 1957 


shape. The Meissinger method can generate some cases of this 
nature using a three-channel diode network. 


The basic Meissinger cases of convergent radiating isoclines 
develop families of curves which expand outward from the con- 
vergent point. These cannot be generated directly by the meth- 


ods which have been described here. However, by substitu- 
tion of multiplications for the summations shown in Fig. 5, func- 
tions of this nature can be represented. 

The other methods for generating functions of two variables 
which Mr. Sherrard has mentioned are certainly more general and 
will provide more uniform accuracy than that described here. 
As he points out, “map readers’’ of this nature are also much 


495 


more expensive, and involve considerably more complexity of 
equipment. 


We have also used a map-reading technique of this nature for 
simulating difficult functions. The method generates a family 
of arbitrary curves, but requires a function generator for each 
curve, a multiplier to interpolate between each pair of lines, and 
three amplifiers per curve. Although this is effective for an 
occasional three-dimensional function, such a technique becomes 
prohibitive if many functions must be simulated in a given prob- 
lem. The method described in the paper has been presented as a 
useful procedure where simplicity and economy of equipment are 
desired. 


| 
" 
4 
{ 
¢ 


| 


Basic Methods tor Nonlinear 


Control-System Analysis 


The important phase-plane and describing-function 
methods are explained with the help of several simple 
examples. It is shown that the phase-plane method is 
best adapted to transient analysis of second-order systems 
and that the describing-function method is intended pri- 
marily for stability analysis of higher-order systems. 
Details and extensions of both methods are discussed in 
two appendixes, and a number of less important methods 
are mentioned. 


linear control systems are described in the literature on 

nonlinear mechanics and numerical analysis (see, for 
example, references 1-7);? not all of these methods, however, 
have been applied to control problems. In addition, control 
engineers have invented new methods or modified existing meth- 
ods to fit their needs. The control-system analyst should be 
familiar with as many methods as possible, since the various 
methods cannot be used interchangeably and some are better 
suited to particular kinds of problems than others. 

Some methods are inherently approximation methods, while 
others are capable of arbitrary refinement. Some methods work 
best (or at all) for systems which are only slightly nonlinear, while 
others can accommodate any degree of nonlinearity. Some 
methods give the response for a particular input or set of initial 
conditions, others indicate whether a system is stable or not, 
others give the amplitude and frequency of a sustained oscillation 
if one occurs, and still others give a birds-eye view of over-all sys- 
tem behavior. Some methods are adaptations or extensions of 
methods used for linear systems, while others are completely dif- 
ferent. 

In a short paper, we cannot list and explain all available 
methods. The paper therefore emphasizes the two methods 
which are most widely used in control-system analysis, the phase- 
plane method and the describing-function method. By means of 
some simple examples, we will attempt to show how the methods 
are used, what sort of information they give, and their advantages 
and limitations. Additional information concerning these two 
methods is given in two appendixes. Several less important 
methods also will be mentioned. 


N UMEROUS methods which could be used to analyze non- 


PHASE-PLANE METHOD 


Phase-plane analysis of dynamic systems goes back at least 50 
years to work of physicists and astronomers. Extension of these 
techniques to control-system problems is, of course, more recent. 


1 Ramo-Wooldridge Corporation; this paper was written while the 
author was employed by the Schlumberger Instrument Company, 
Ridgefield, Conn. 

2 Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by the Instruments and Regulators Division of Tue 
AMERIOAN Socrery OF MECHANICAL ENGINEERS and presented at the 
ASME-AIEE Conference on Nonlinear Control Systems, Princeton, 
N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 


6, 1956. Paper No. 56—IRD-9. 


By T. M. STOUT,' LOS ANGELES, CALIF. 


497 


Some of the earliest applications can be found in work of Doll (8), 
Hurewicz (9), MacColl (10), and Weiss (11). 

The phase plane, whose co-ordinates are usually the system 
error and error rate, provides both a means for representing phe- 
nomena in nonlinear systems and a means for detailed analysis of 
such systems. If the system is described by a second-order dif- 
ferential equation and if the input is a step or ramp function, the 
state of the system at any time is completely specified by a point 
in the phase plane. A succession of such points, joined by a con- 
tinuous curve to show the history of the system, is called a “tra- 
jectory.”’ 

In the examples which follow, we assume that the trajectories 
are known, either by experiment or calculation. We adopt this 
approach to focus attention on use of the phase plane and the 
information that can be obtained from it. The real value of the 
phase-plane method lies, of course, in the fact that trajectories 
showing system behavior can be constructed even when the vari- 
ables cannot be found directly as functions of time. A number of 
methods for constructing trajectories are explained in Appendix 1. 


Examples 


The following examples all concern a positioning servomecha- 
nism with an output member characterized by inertia (J) and 
viscous friction (f). The torque used to position the output mem- 
ber is, in general, a nonlinear function of the error between the de- 
sired and actual positions of the output shaft. 

1 Linear System. Any method for analyzing nonlinear sys- 
tems also must work for linear systems, although the converse is 
unhappily not true. We begin by considering a system in which 
the torque is proportional to the error. The torque-error relation 
is expressed in this case by the single equation 


T =Ke..... (1] 


The system trajectories have the possible forms shown in Fig. 1. 
We suppose that the system has been subjected to a step input 
and starts at point A. If the system damping (proportional to f) 
is small, the error reaches zero in an oscillatory manner, shown by 
the spiral trajectories of Fig. 1(A). If the system damping is 
large, the error decreases to zero without overshoot, as shown in 
Fig. 1(B). If the system damping is zero, continuous oscillations 
occur; the system returns to its starting point on a closed ellipti- 
cal trajectory, as shown in Fig. 1(C). 

Since a positive error rate always corresponds to an increasing 
error and a negative error rate to a decreasing error, the arrows 
along the trajectories always point to the right in the upper half 
of the plane and to the left in the lower half. 

This system is stable for all parameter values (except f < 0), 
since all trajectories end at the origin, and the steady-state error 
for step inputs is zero. Because the system is linear, trajectories 
for other step magnitudes can be drawn by a simple scaling proc- 
ess, expanding or contracting the given trajectories to pass 
through the specified initial point. 

2 Saturating Linear System. The system considered in the 
first example becomes nonlinear if the torque becomes constant 
(+T,,) whenever the absolute error exceeds a certain value (e,), 


TRANSACTIONS OF THE ASME 


498 
| Ae 
(A) (B) 
Fig. 1 
Damping; (B) Large Damprna; 
TORQUE 
*Tmt-- 
ERROR 
es 
4- Tm 
(A) 
| 
—~— SATURATED—=+-— LINE ZONE SATURATED 
| 2 3 
-3 -2 H 
| 
VELOCITY LIMIT > 
I 
I -2 ! 
(B) 
Fic. 2(A) Torque-Error Revation 1N SatTuraTinG LINEAR 
System; (B) PuHase-PLane REPRESENTATION OF Step 


as indicated in Fig. 2(A). The torque-error relation is now given 
by three equations 


= +T, 
= —T, 


The phase plane likewise may be divided into three regions, a 
linear region bounded by the lines e = +e, and two saturated re- 
gions. In the linear region, the trajectories have the possible 
shapes of Fig. 1, the exact shape depending on the parameters of 
the system. In the saturated regions, the trajectories approach 
a limiting error rate, é = +T,,/f. 

A typical trajectory, obtained by fitting together trajectories 
appropriate to the respective regions, is shown in Fig. 2(B) for the 
case T,, = J =f =e, = 1. The system is again stable and all 
trajectories will end at the origin. Since the trajectories cannot 
enter the linear region with an error rate greater than the limiting 


PuHaAse-PLANE REPRESENTATION OF LINEAR-SysTeM Benavior: (A) SMALL 


(C) No Dampine 


swiTCHING LINE 


Fic. PxHase-PLane REPRESENTATION OF ReELay System 


Fic. 4 Pxase-PLane RepreseNnTaTiION oF System 
Time Detay 


rate, the overshoot cannot be much greater than that shown, 
about one third of a unit for a three-unit step input. 

3 Relay System. If e, in the previous system is made smaller 
and smaller by increasing K, the system becomes effectively a re- 
lay system in which the torque-error relation is 


= +T, e>0 
= —T,, 


é 
: 
= 
+ te 
| 
| 
(C) 
i] 
2 
2 3 
3 -2 
-2 
{ 
EFFECTIVE SWITCHING LINE 
/ 
2 3 
|8 
| 


APRIL, 1957 


The phase plane is now divided into two parts with relay opera- 
tions occurring on the dividing line between the two regions, 
e = 0. The trajectory following a four-unit step is shown in Fig. 
3. The system is again stable and the maximum possible over- 
shoot is about one-third unit. 

4 Relay System With Time Delay. We suppose as before that 
the relay starts to operate when the error is zero, but that torque 
reversal actually occurs 7, time units later. Since the distance 
traveled in a fixed time is proportional to the velocity, the effec- 
tive switching line is 


(4) 


As shown in Fig. 4, switching is initiated at point B but torque re- 
versal does not occur until the trajectory reaches point C. The 
system is now unstable with all trajectories ending in a closed 
curve or “limit cycle.’’ The amplitude of the limit cycle in this 
case is about one-third unit; the period of the oscillation is not 
available directly but turns out to be about 3.7 time units. 

5 Relay System With Dead Zone. In this case, we suppose that 
a finite error signal is needed to operate the relay but that relay 
operation is instantaneous. The torque-error relation is therefore 


e—T,é = 0. 


= +T,, 
T=0 {5] 
=a e<—e, } 


The phase plane is again divided into three regions. In the re- 
gions corresponding to +7',, and —T’,,, the trajectories are similar 
to those of the saturating linear system or relay system (Ex- 
amples 2 and 3). In the center zone where no torque is applied 
to the output member, the error rate decreases because of friction 
losses; the trajectories in this region are straight lines with a 
slope —f/J. 

A typical trajectory is plotted in Fig. 5 for the case eg = 0.25; 
the overshoot is about 0.4 unit and the final error about 0.125 
unit. This system is stable in the sense that the error rate even- 
tually reaches zero; there is, however, a steady-state position 
error less than or equal to e,. 


Fic. 5 Pxase-PLane REPRESENTATION OF Retay System WITH 
Deap Zone 


Comments 


The phase-plane method is suitable for transient analysis of 
systems described by a single nonlinear differential equation of 
second order or by a succession of linear or nonlinear differential 
equations, each applicable in certain ranges of the variables. 
Several nonlinearities can be considered simultaneously. A 
phase-plane analysis can be made as accurate as desired and shows 
the general nature of the system behavior. In addition, the phase- 


plane plots give a direct indication of the degree of damping or 


499 


amount of overshoot and the amplitude of any sustained oscilla- 
tions which may occur. 

The times associated with each point on a trajectory, needed 
to find the response time for step inputs or the period of any sus- 
tained oscillations, are not directly available; they can, however, 
be computed from the relation 


. [6] 


or by a graphical construction due to Diprose (13). 

A unique relation between the system trajectories and the 
phase-plane co-ordinates exists only for second-order systems sub- 
jected to step or ramp inputs. The behavior of a second-order 
system with more general inputs can be computed by generaliza- 
tions of the basic phase-plane techniques and recorded in the 
error-error rate plane. For higher-order systems, a phase space 
is required; projections of phase-space trajectories into the error- 
error rate plane may still be useful (12, 14, 36). 


DescriBING-FuNCTION METHOD 


The describing-function method rests on the work of Fourier 
and parallels techniques used for some time in the field of non- 
linear mechanics. As a method for analysis of nonlinear control 
systems, it seems to have been developed independently and al- 
most simultaneously by Tustin in England (15), Goldfarb in 
Russia (16), Oppelt in Germany (17), Dutilh in France (18), and 
Kochenburger in the United States (19). Numerous extensions 
and applications of the basic idea have been made in the past few 
years. 

The describing function for a nonlinear element is found by ap- 


SINUSOIDAL 


APPROXIMATION 
INPUT ouTPUT 


LINEAR NONLINEAR 


r Xo 
+ 2 


Fie. 7 Brock Diacram or Contrrou System Wirth a NONLINEAR 
ELEMENT 


= 
1 
tas = -de.... 
e 
q 
é 
(A) 
1 72; 
i 
*s 
---K 
! 
- ° 
! 
(B) 
| 
Fic. 6 Descripine Function ror SaturatTinGc Linear ELEMENT 
| 


500 


plying a sinusoidal input to the element and, using standard 
Fourier analysis techniques, determining the fundamental com- 
ponent of the distorted output. The describing function has two 
parts: (a) The output-input amplitude ratio, computed from the 
peak values of the fundamental output component and the sinu- 
soidal input; and (b) the phase shift between the sinusoidal input 
and the fundamental output component.* For elements which 
are completely described by a curve relating the instantaneous 
input and output, the describing function depends on the magni- 
tude of the input but is independent of its frequency. 

The describing function for a saturating linear element is shown 
in Fig. 6. For inputs less than z,, the describing function is K, 
the slope of the output-input curve. For larger inputs, the output 
magnitude is limited and the describing function decreases. 

Use of describing functions is based on the block diagram shown 
in Fig. 7. For checking stability, we can assume the input (r) 
is zero. The condition for the existence of a sustained oscillation 
is 


e=—c...... ere, 
which requires that 
or 


if we let G = G,G,. In these equations, G, and G, represent or- 
dinary linear gain functions which are independent of amplitude 
but dependent on frequency, and Gp is the describing function for 


TRANSACTIONS OF THE ASME 


We then plot G and —1/Gp as separate curves in a standard polar 
plot showing the magnitude and phase of each quantity. Inter- 
sections of the two curves, if any exist, give the approximate 
magnitude and frequency of possible sustained oscillations. 
Further analysis is required to determine whether the intersec- 
tions are points of convergent or divergent equilibrium (19). 


EXAMPLES 


1 Linear System. If there is no saturation, the torque-error 
relation is simply Gp = K. The curve —1/G@p becomes the point 
—1/K. As shown in Fig. 8, G(w) is infinite and has a phase angle 
of —90 deg at zero frequency, and approaches zero with a phase 
angle of —180 deg at infinite frequency. As is the case with the 
usual Nyquist criterion, the system is stable since the curve 
G(w) does not enclose the critical point —1/K, regardless of the 
value of K. 

2 Saturating Linear System. Fig. 6 indicates that saturation 
causes Gp to become less than K when the input to the saturating 
element is large. Thus 1/Gp is always equal to or greater than 
1/K, and —1/@p occupies the entire negative axis of the polar 


SINUSOIDAL 
| APPROXIMATION 


RELAY 


the nonlinear element. In our examples, G is given by (a) 
Gw) (10) 
jw(Jjw + f) 
(11) 
jo(jw + 1) 
and Gp represents the nonlinear torque-error relation. 
To check for the existence of sustained oscillations, we will 
write Equation [9] in the equivalent form 
1 
[12] (8) 
Fie. 9 DescripinG Function FoR IpEaL RELAY 
3 For the examples used in this paper, the phase shift is zero. 
Z 
/ 
/ 
/ 
/ 
Aj 


Fie. 8 Porar Piotr ror Linear System AND 
SaturaTinG Linear System 


Fie. 10 ror System WiTH 
Ipeat RELAY 


APRIL, 1957 


Fig. 11 Potar Pior ror System AND TIME 


2 
(B) 


Fic. 12 Descrrsing Function ror Retay Wirn Deap Zone 


plot to the left of the point —1/K, as shown in Fig. 8. The system 
is stable under the conditions shown. It could exhibit sustained 
oscillations if K were increased and if additional phase lags, indi- 
cated by the dashed line, were introduced. 

3 Relay System. The calculation of the describing function 
for an ideal relay is based on Fig. 9(A). Since the relay is as- 
sumed to have no dead zone the relay output is a square wave for 


a sinusoidal input of even infinitesimal amplitude. Since the 
fundamental component of a square wave is (4/7) (peak value), 
Gp is a constant divided by X,, giving the hyperbola plotted in 
Fig. 9(B). Since Gp now ranges from zero to infinity, —1/Gp 
occupies the entire negative axis of the polar plot, as shown in Fig. 
10. For the G(w) considered in these examples, the system is 
stable; with any slight additional phase shift, sustained oscilla- 
tions would take place. 

4 Relay System With Time Delay. Addition of time delay to 
the relay system can be represented by adding a phase shift pro- 
portional to frequency to G(w), giving the curve shown in Fig. 11. 


501 


The curves G(w) and —1/Gp intersect at point A, indicating the 
possibility of a sustained oscillation. In terms of the usual 
Nyquist criterion, all points on —1/Gp between A and the 
origin are enclosed by the curve G(w), so that the system is un- 
stable in some sense for the small X,; amplitudes associated with 
Gp in this range. In similar fashion, the system is stable for points 
on —1/G@p to the left of A which are not enclosed and which 
correspond to large X; amplitudes. Point A is therefore a con- 
vergent equilibrium point, since larger amplitudes decay to A and 
smaller ones grow to A. Following any slight disturbance, this 
system will therefore oscillate with the approximate frequency 
and amplitude associated with point A. 

5 Relay System With Dead Zone. If the relay has a dead zone, 
its output is obviously zero for small inputs. The describing func- 
tion Gp is therefore zero for small inputs, increases rapidly for in- 
puts slightly in excess of the dead zone, reaches a maximum, and 
then returns to zero, as shown in Fig. 12. The polar plot of Fig. 


Fic, 13 Potark Piotr ror System With Havinec Deap 
ZONE 


13 indicates that the system is stable and would, in fact, remain 
stable with the introduction of additional phase lags (indicated 
by the dashed line). With a smaller dead zone, however, the 
maximum value of Gp increases, —1/G@p occupies a larger part of 
the negative axis and comes closer to the origin, thus creating the 
possibility of an intersection of —1/Gp with the G(w) curve 
shown by the dashed line. 

In this particular case, any intersection of —1/Gp with G(w) 
is, in fact, two intersections. The intersection with the upper 
branch of —1/G@p is a convergent equilibrium point (for reasons 
discussed in connection with Example 4) corresponding to a 
large amplitude oscillation. The other intersection, on the lower 
branch of —1/Gp, is a divergent equilibrium point and represents 
a sort of dividing line between disturbances which decay back 
inside the dead zone and those which build up into a stable oscilla- 
tion. 


Comments 


The describing-function method gives a direct indication of the 
existence, amplitude, and frequency of possible sustained oscilla- 
tions. In general, the errors in the predicted amplitude and fre- 
quency are not more than ten per cent (19). This accuracy is 
probably consistent with the original data and adequate for most 


A 
G, 
-| 
x; 
P DeLay Gy Fy 
RELAY SINUSOIDAL 
with APPROXIMATION 
DEAD ZONE 
INPUT 
| | 
G (w) 
(A) 
2 
x 
| 
| 


502 


engineering purposes. On the other hand, the describing-function 
method is inherently an approximate method and may therefore 
fail occasionally to predict a sustained oscillation which actually 
does occur or may predict one which does riot occur. 

The proximity of the —1/Gp and G(w) curves gives a rough 
idea of the relative stability of the system and thus provides a 
very qualitative indication of the transient response to be ex- 
pected. The curves also can be used to construct approximate 
frequency-response curves for the system; unlike the linear case, 
the output-input ratio is a function of amplitude as well as fre- 
quency. 

In contrast with the phase-plane method, the describing-func- 
tion method is applicable to systems in which the linear elements 
are described by differential equations of third or higher order. 
The effects of various compensation methods (which increase the 
order of the differential equations) can be examined by making 
appropriate modifications of the G(w) curve, just as in the case of 
linear system design. 

Some of the assumptions underlying the describing-function 
method are discussed in Appendix 2. 


Orner Metuops 


A number of other methods, less widely used, are available for 
analysis of nonlinear control systems. Most of the methods men- 
tioned in the following paragraphs are useful for determining the 
response of systems to particular inputs. 

Numerical integration techniques, in principle, can always be 
used if the differential equations can be written. Many of the 


numerical methods concern a first-order equation 
= F(z, t).... .. [13] 


or systems of such equations, although methods also exist for deal- 
ing directly with second-order equations. These methods are 


essentially extrapolation and correction schemes. The simplest 
procedure is linear extrapolation, using the relation 


a(t + At) = x(t) + Atet) 


The computed value of 2 can be used to obtain an average z in the 
interval At, from which a more accurate value of z can be found. 
Still better methods are described in texts on numerical analysis 
(5 to 7). These methods are somewhat tedious for hand use but 
are basic to automatic digital computers. 

Equivalent methods in which the integrations are performed 
graphically (instead of numerically) have been proposed by Hsia 
(20) and Paynter (21). 

A number of step-by-step methods combining numerical and 
graphical techniques have been proposed by Tustin (22), Madwed 
(23), and Stout (24). In these methods, the response of a linear 
element is computed from its input by numerical methods, mak- 
ing direct use of the superposition principle, and the nonlinearity 
is taken into account by a graphical construction. 

Another adaptation of the superposition principle was used by 
Kahn (25) in a study of relay servomechanisms. Since the system 
following the relay was linear, its output could be found by super- 
position of step responses corresponding to the application and 
removal of a fixed input by the relay. The variables entering ele- 
ments preceding the relay determine the times of relay operation 
but do not otherwise affect the system output. 

In certain systems, such as the saturating or relay systems used 
in the examples, operation is governed by a number of linear 
differential equations applicable in restricted ranges of the 
system variables. This property, known as “‘piece-wise linearity,’’ 
can be used to good advantage in phase-plane analysis and also can 
be used to construct response curves directly as functions of 
time. The response for any input is found by joining solutions of 


TRANSACTIONS OF THE ASME 


the linear equations at transitions from one mode of operation to 
another, the final conditions for one mode becoming the initial 
conditions for the next. Hazen (26) investigated the conditions 
for sustained oscillation in relay servomechanisms using this 
method, and Nichols (27) recently used the same approach to 
check the accuracy of a describing-function analysis. 


CONCLUSIONS 


The phase-plane and describing-function methods can furnish 
answers to many (but not all) of the performance questions which 
concern the control-system designer. The two methods comple- 
ment each other, the phase-plane method being useful for second- 
order systems and the describing-function method for higher-order 
systems. The phase-plane method deals primarily with 
transient conditions, however, while the describing-function 
method is concerned with steady-state oscillatory conditions. 
Extensions of both methods have been and will be devised to 
remedy these limitations and to increase their usefulness. 

A number of other methods are available for nonlinear con- 
trol-system analysis. Unlike the phase-plane and describing- 
function methods these methods reveal general features of sys- 
tem behavior only after exhaustive investigation of all kinds and 
sizes of system inputs. These methods are therefore better 
adapted to analysis of specific systems than to system design or 
modification. 


BIBLIOGRAPHY 


1 “Introduction to Nonlinear Mechanics,’"’ by N. Minorsky, 
J. W. Edwards Bros., Inc., Ann Arbor, Mich., 1947. 

2 “Theory of Oscillations,””’ by A. A. Andronow and C. E. 
Chaikin, Princeton University Press, Princeton, N. J., 1949. 

3 “Nonlinear Vibrations,”” by J. J. Stoker, Interscience Pub- 
lishers, Inc., New York, N. Y., 1950. 

4 “Ordinary Nonlinear Differential Equations in Engineering 
and Physical Sciences,”” by N. W. McLachlan, Clarendon Press, Ox- 
ford, England, 1950. 

5 “Numerical Mathematical Analysis,’ by J. B. Scarborough, 
Johns Hopkins Press, Baltimore, Md., second edition, 1950. 

6 “Numerical Solution of Differential Equations,” by W. E. 
Milne, John Wiley & Sons, Inc., New York, N. Y., 1953. 

7 ‘‘Numerische Behandlung von Differentialgleichungen,”’ by L. 
Collatz, Julius Springer, Berlin, Germany, 1951. 

8 “Automatic Control System for Vehicles,’’ by H. G. Doll, U. 8. 
Patent 2,463,362. 

9 “Servos With Torque Saturation—Part II,"" by W. Hurewicz, 
Massachusetts Institute of Technology, Cambridge, Mass., Radio 
Laboratory Report 592, September 28, 1944. 

10 ‘Fundamental Theory of Serve hanisms,” by L. A. Mac- 
Coll, D. Van Nostrand Company, Inc., New York, N. Y., 1945. 

11 “Analysis of Relay Servomechanisms,” by H. K. Weiss, Jour- 
nal of the Aeronautical Sciences, vol. 13, July, 1946, pp. 364-376. 

12 ‘Design and Analog Computer Analysis of an Optimum Third- 
Order Nonlinear Servomechanism,”’ by H. G. Doll and T. M. Stout, 
published in this issue, pp., 513-525. 

13. Discussion, K. V. Diprose, ‘‘Automatic and Manual Control,” 
Academic Press, New York, N. Y., 1952, p. 304. 

14 “A Method for Solving Third and Higher-Order Nonlinear 
Differential Equations,” by Y. H. Ku, Journal of The Franklin Insti- 
tute, vol. 256, September, 1953, pp. 229-243. 

15 “The Effects of Backlash and Speed Dependent Friction on the 
Stability of Closed-Cycle Control Systems,”’ by A. Tustin, Journal of 
The Institution of Electrical Engineers, vol. 94, part ILA, May, 1947, pp. 
143-151. 

16 “On Some Nonlinear Phenomena in Regulatory Systems,” by 
L. C. Goldfarb, Avtomatika i Telemekhanika, vol. 8, 1947, pp. 349-383, 
Translation in National Bureau of Standards Report 1691, May 29, 
1952. 

17 “Locus Methods for Regulators with Friction,”’ by W. Oppelt, 
Zeitschrift VDI, vol. 90, June, 1948, pp. 179-183. Translation in 
National Bureau of Standards Report 1691, May 29, 1952. 

18 “Theory of Relay Servomechanisms,” by J. R. Dutilh, Onde 
Electrique, vol. 30, October, 1950, pp. 438-445. 

19 “A Frequency Response Method for Analyzing and Synthe- 


APRIL, 1957 


sizing Contactor Servomechanisms,” by R. J. Kochenburger, Trans. 
AIEE, vol. 69, part 1, 1950, pp. 270-284. 

20 “A Graphical Analysis for Nonlinear Systems,” by P. 8S. Hsia, 
Proceedings of The Institution of Electrical Engineers, vol. 99, part IT, 
April, 1952, pp. 125-134. 

21 ‘How to Analyze Control Systems Graphically,”” by H. M. 
Paynter, Control Engineering, vol. 2, February, 1955, pp. 30-35, 
March, pp. 72-78. 

22 ‘tA Method of Analyzing the Effect of Certain Kinds of Non- 
linearity in Closed-Cycle Control Systems,’ by A. Tustin, Journal 
of The Institution of Electrical Engineers, vol. 94, part ILA, May, 1947, 
pp. 152-160. 

23 ‘Number Series Method of Solving Linear and Nonlinear Dif- 
ferential Equations,” by A. Madwed, Massachusetts Institute of 
Technology, Cambridge, Mass., Instrumentation Laboratory Report 
6445-T-26, April, 1950. 

24 “A Step-by-Step Method for Transient Analysis of Feedback 
Systems With One Nonlinear Element,”’ by T. M. Stout, AIEE Tech- 
nical Paper No. 56—779. 

25 ‘An Analysis of Relay Servomechanisms,” by D. A. Kahn, 
Trans. AIEE, vol. 68, part 2, Applications and Industry, 1949, pp. 
1079-1088. 

26 “Theory of Servomechanisms,” by H. L. Hazen, Journal of 
The Franklin Institute, vol. 218, September, 1934, pp. 279-330. 

27 “Backlash in a Velocity Lag Servomechanism,” by N. B. 
Nichols, Trans. AIEE, vol. 73, part 2, Applications and Industry, 
1954, pp. 462-467. 

28 ‘Analysis of Nonlinear Servos by Phase-Plane-Delta Method,” 
by R. N. Buland, Journal of The Franklin Institute, vol. 257, January, 
1954, pp. 37-48. 

29 “Oscillation of a Third-Order Nonlinear Autonomous System,” 
by L. L. Rauch, ‘‘Contributions to the Theory of Nonlinear Oscilla- 
tions,”’ Princeton University Press, Princeton, N. J., 1950, pp. 39-88. 

30 “Analysis and Design Principles of Second- and Higher-Order 
Saturating Servomechanisms,”’ by R. E. Kalman, AIEE Paper 55-551. 

31 “Sinusoidal Analysis of Feedback Control Systems Containing 
Nonlinear Elements,’”’ by E. C. Johnson, Trans. AIEE, vol. 71, part 
2, Applications and Industry, 1952, pp. 169-181. 

32 ‘“Dissymmetrical Servomechanisms,”” by J. Loeb and J. D. 
Lebel, Annales des Télécommunications, vol. 9, October, 1954, pp. 
282-286. 

33 “The Mechanism of Subharmonie Generation in a Feedback 
System,” by J. C. West and J. L. Douce, The Institution of Electrical 
Engineers Paper 1693, 1954. 

34 ‘Nonlinear Control Systems with Random Inputs,’ by R. C. 
Booton, Transactions of the Institute of Radio Engineers—Profes- 
sional Group on Circuit Theory, vol. CT-1, March, 1954, pp. 9-18. 

35 “Operating Modes of a Servomechanism with Nonlinear Fric- 
tion,”’ by H. Lauer, Journal of The Franklin Institute, vol. 255, June, 
1953, pp. 497-511. 

36 ‘Phase-Plane Analysis of Automatic Control Systems With 
Nonlinear Gain Elements,’”’ by R. E. Kalman, Trans. AIEE, vol. 73, 
part 2, Applications and Industry, 1954, pp. 383-390. 


Appendix 1 
PHASE-PLANE MetTuop 


Construction of Trajectories 


Direct Methods. Phase-plane trajectories for a second-order 
system can be obtained in a variety of ways. An z-y plotting 
table or oscilloscope might be connected to the actual system or an 
analog computer to record trajectories directly. 

If the system is linear in some part of the plane, ordinary 
analytical methods can be used to solve the appropriate differen- 
tial equations. Points representing values of error and error rate 
at a particular time may then be plotted and joined by a con- 
tinuous curve. For example, the saturated region in the examples 
is governed by the equation 

Jé+fe+T, =0... {15] 


which has the solutions 


Te 
+ —t— 


f 
when the initial error rate is zero and the negative sign is used in 
Equation [15]. In this case, it is comparatively simple to elimi- 
nate the time variable ¢ analytically; the result is 


[18] 


The trajectory obtained from Equation [18] by taking e = 0, 
and a similar trajectory obtained by using the positive sign in 
Equation [15], can be shifted horizontally to pass through any 
point in the phase plane. 

Indirect Analytical Methods. If these were the only methods for 
obtaining trajectories, the phase-plane would not be very valu- 
able. Fortunately, however, trajectories can be constructed when 
the original differential equation cannot be solved directly for the 
variables ag functions of time. As a simple illustration of an 
analytic method for doing this consider the differential equation 
applicable to the dead zone in Example 5 


Jé+fé =0.... (19) 


For convenience, we introduce the new variables 


(20) 
(21) 


Since the error acceleration is é = dy/dt, Equation [19] may be 
written 


Dividing Equation [22] by Equation [21], we obtain 
dy/dt dy 


a first-order differential equation for the trajectories. Integration 
of Equation [23] gives 
f 


7 (24) 


where K, is a constant of integration to be evaluated from initial 
conditions. With K, evalusted and the original notation re- 


stored, Equation [24] becomes 


(25) 


é= — 


an algebraic expression for the trajectories in the dead zone. 
In general, the original differential equation is likely to be more 
complicated than Equation [19] and may be written 


+ Ge, + We, = 0 
The same process that produced Equation [23] now gives 


dy —(x, yy — Wz, ye 


Although Equation [27] is a first-order differential equation in z 
and y, it may not be solvable by analytic methods and graphical 
methods may therefore become necessary. 

Isocline Method, One graphical method for solving Equation 
[27] involves the use of “‘isoclines’’ which are the loci of points of 


503 

e—e& = — in (1— fe) 

<a 
| 

| 


0.8 1/0 


JA xe e 


Fic. 14 Trasecrory ConstrucTION FOR LINEAR SysTEM BY 
IsocLinE 


y 


x= -@ (y) 


Fie. 15 Basic Princrpce or LignarRp CONSTRUCTION 
constant slope dy/dxr = C. In the linear region, the isoclines are 
given by 


—fy — Es 


Ty [28 | 


or, if J =f = K =1,by 


Isoclines plotted from Equation [29] are shown in Fig. 14. To 
construct a trajectory starting at some point A (say, z = l,y = 
0): 

1 Draw a line through A with the indicated slope C (at this 
point C = o, so the line is vertical). 

2 Draw a line through A with the slope associated with the 
next isocline (in this case, C = 5). 

3 Draw a line bisecting the angle between the two lines 
drawn through A. 

The bisector thus drawn is assumed to approximate the trajec- 
tory between the two isoclines. The intersection of this bisector 
with the second isocline gives a new initial point at which the 
process can be repeated. A few examples of the process are shown 
in Fig. 14. 


TRANSACTIONS OF THE ASME 


The accuracy of this construction is increased by using a large 
number of segments to approximate a trajectory. If only moder- 
ate accuracy is required it is possible to superimpose short line 
segments of the proper slope on each isocline (as shown in the 
fourth quadrant of Fig. 14) and to sketch the trajectories free- 
hand. 

Lienard’s Construction. Another graphical method, known as 
Lienard’s construction (3), is available for differential equations 
which can be put into the form 


dy _ —Hy)—z 


dx y [30] 


Before any trajectories are constructed, the curve x = —@(y) is 
plotted in the phase plane, as shown in Fig. 15. To construct a 
trajectory starting at some point P: 

1 Draw a line through P parallel to the z-axis, intersecting 
the curve at point R. 

2 Draw a line from R to S perpendicular to the z-axis. 

3 Draw the line PS. 

4 Draw a line through P perpendicular to PS. 
The line drawn through P in step 4 is tangent to the trajectory, 
since its slope is 


dy/dzx = —PR/RS and PR = PQ + QR =2+ Gy) 
RS =y 


In practice, steps 3 and 4 can be simplified by using a compass to 
construct an arc through P with its center at S; a short segment 
of this are may be a good approximation of the trajectory if both 
have the same curvature. Applications of this method can be 
found in a paper by Lauer (35). 


Time From Phase-Plane Plots 


Although arrows on the trajectories show the direction of in- 
creasing time, the time associated with any particular point on the 
trajectory is not immediately available. Additional work is 
necessary to find, for example, the response time for a step input 
or the period of a sustained oscillation from trajectories con- 
structed by any of the graphical methods. Two basic procedures 
are available for this purpose. 

From the basic definition 


Fic. 16 APPROXIMATION OF TRAJECTORIES BY CIRCULAR ARCS FOR 
DeTeRMINING TIME INTERVALS 


504 
y:@ 
-2 -2.8 
-3 
-4 
a 
0.4 0.6 
———— 
-0.2 
/ 
i 
7 
-0.4 / 
os 
o25 
-O6 
-025 
I 
R = 
Ss | 
1 
a 
La 
05 8 
x 
q 
-0.5 


APRIL, 1957 


it follows that 


1 
dt = 


[32] 


The time required to go from point A to point B on a trajectory 
is therefore 


[33] 


dr Ag 
te-{ 22>... 


A Yave 


ze Y 


where avg is the average velocity for the distance Az. This sort 
of calculation is somewhat inconvenient, especially if y goes 
through zero between A and B. Equation [33] may be useful in a 
qualitative way, however, for showing which of two trajectories 
spends the least time between A and B. 

A more direct procedure, usable even when y goes to zero on the 
segment of trajectory under consideration, has been suggested by 
Diprose (13). His procedure is based on approximation of sec- 
tions of the trajectory by ares of circles centered on the z-axis and 
could be applied as shown in Fig. 16. The center C, of the first 
circle is located by trial and error so that are AB closely approxi- 
mates segment AB of the trajectory. The time interval between 
A and B is then given by 

Tan = Ar... [34] 
where T is the ratio of the scale factors on the z and y-axes, defined 
as 

number of z units per division 


number of y units per division 


In the case shown, 6, = 26 deg = 0.454 radians, and tT = 1.0/0.5 
= 2, so that 


Tap = (0.454)(2) = 0.908 see 


In similar fashion, segments BD and DF are approximated by 
arcs of circles having their centers at C. and C3, angles 6. and 0; 
are measured, and the corresponding times calculated. 

Where detailed knowledge of the trajectories is not needed and 
a qualitative idea of system behavior is sufficient, the labor re- 
quired for construction of trajectories and calculation of time in- 
tervals can be avoided by using the notion of singular points. 


Singular Points 


As may easily be imagined, difficulty in constructing system 
trajectories is experienced if dy/dz takes the form 0/0, since the 
slope of the trajectory is then indeterminate. Such points are 
called “‘singular points’’ and are treated at length in texts on non- 
linear mechanics (1 to 3). 


(A) (B) (C) 


Fig. 17 Unstasie SinGuLar Points: (A) Sappie Pornt; (B) 


Sprrat or Focat Pornt; (C) Nopau Point 


505 


For our purposes, we will assume that Equation [27] has been 
put into the form 
dy t+ + Ms, 


[36] 


[37] 

by expanding the numerator in a power series and then dropping 

the higher powers of z and y represented by F(z, y). The singular 

points of this equation all occur at the origin. The nature of the 

singular points and the nearby trajectories depends on the coef- 

ficients a and b: 

(1) b?+4a<0. The singular point is a center or vortex point 
if b = 0 and is surrounded by elliptical trajectories, none of which 
ever reaches the origin, as shown in Fig. 1(C). If b # 0, the 
singular point is a spiral or focal point, surrounded by spiral tra- 
jectories which all reach the origin, as shown in Fig. 1(A). 

(2) b? + 4a = 0. The singular point is a nodal point or node. 
All trajectories reach the origin tangent to a single straight line, 
as shown in Fig. 1(B). 

(3) b? + 4a > 0. The singular point is a nodal point or node 
ifa <0. Ifa > 0, the singular point is a saddle point, shown in 
Fig. 17(A). Two trajectories pass through a saddle point; the 
remainder approach the saddle point and turn away. 

Spiral and nodal points are stable (trajectories approach the 
singular point) if b < 0 and unstable (trajectories leave the singu- 
lar point) if b > 0. Unstable spiral and nodal points are shown in 
Figs. 17(B) and 17(C). 

The saddle point is a point of unstable equilibrium and is en- 
countered in control systems in which the torque-error relation is 
T = T,, sine 
as might occur with certain types of error-sensing devices. In 
this case, the singular points do not occur only at the origin. A 
phase-plane plot of typical behavior for such a system is given in 

Fig. 18. 


Fic. 18 Puxase-PLane REPRESENTATION OF SysTEM 
Foca. Points AND SappLEe Points 


Systems With Time-Varying Inputs 


Ramp Inputs. In the examples considered so far, the input has 
been a step function. With the system at rest, the initial con- 
ditions are e(0) = e) and é(0) = 0, a point on the e or z-axis of the 
phase plane. The graphical constructions which have been de- 
scribed are applicable, with appropriate modifications, to some 
systems subjected to ramp inputs. 

As an example, we will consider a system described by the 
equation 


Fic. 19 Friction Torque Versus Output VeLocity 


x*-Oly) 


-2+ 

Fic. 20 PxHase-PLaNe ConsTRUCTION ror System Wita Ramp 
Input (R = 1.0) 

[39] 

.. [40] 


w(é) +e =r. 
= R + Rt. 


where c is the output or controlled variable, r is the input or 
reference, and yu(é) denotes a friction torque which is a nonlinear 
function of the output rate é. As indicated in Fig. 19, the total 
friction torque includes viscous, coulomb, and friction com- 
ponents. 

In order to use Lienard’s construction, the equation for the 
system is rewritten using the basic definitions 


e=r—c=R+R—c... 
é=*—é=R—E.. 
giving é+wR—é)+e=0.... 


Substituting z = e and y = é, Equation [44] can be written 


dy —u(R—y)—z 


TRANSACTIONS OF THE ASME 


Puase-PLaANe Construction ror System With Ramp 
Input (R = 0.25) 


Fic. 21 


where 
oy) = u(R — y). 


Equation [47] shows that the curve z = —@(,y) used in the Lien- 
ard construction is simply the friction curve yu(¢) rotated 90 deg 
and then translated vertically along the y-axis to account for R. 

For R = 1, the curve is translated upward one unit, as shown 
in Fig. 20. With R = 3 the trajectory starts (at A) with e(0) 
= 3, (0) = 1, andends (at B) wheree,, = 1.5andé =0. Point 
B may be called a stable focal point. 

For 0 < R <0.5, say R = 0.25, the steady-state error in this sys- 
tem is not constant but periodic, as shown in Fig. 21. With R = 
3, the trajectory starts at A with e(0) = 3, &(0) = 0.25, and goes 
to B. According to Equation [46], points for which z = —d¢(y) 
and y t 0 are points of zero slope, so the trajectory follows the 
curve from B to D. The trajectory leaves the curve at D, and re- 
turns to the curve at F where a periodic motion begins. The 
singular point at G is an unstable focal point. 

General Inputs. If this system were subjected to a general time- 
varying input, say r = r(t), Equation [39] would become 


+ +e = rit). 
Letting z = cand y = é, we would obtain the equation 


dy _ r(t)— uly) —2 
dx y 


(47) 


[48] 


[49] 


which is similar to Equation [27] except for the numerator term 
r(t). The presence of this term destroys the unique dependence of 
dy/dz on the phase-plane co-ordinates z and y. Curves can still 
be drawn showing the history of the system, using extensions of 
the methods already described, but the significance of the curves 
is no longer the same. Procedures for cases of this type are given 
by Ku (14) and Buland (28). 

Higher-Order Systems. Faced with the third-order differential 


equation 
+ aé + + me = 0.... . (50) 


we could make the substitutions 


506 
pie) é 
a x= - oly) 2 
4 
2 
4 
4 
4 BF A 
-2 
4 
4 
! 
4 
-2 
4 -2 
| é 
2 
2 3 
-2 
; 
| 
[42] 
=¢=—.... 5 
[45 ] {52] 
—Hy)—z dy 


APRIL, 1957 


and examine system behavior in a three-dimensional phase space 
having co-ordinates z, y, and z. Because of difficulties in plotting 
curves in three dimensions, we would probably prefer to project 
the system trajectories into two planes, where the differential 
equations would become 


dz —a2z — ay — aor 


54 
[54] 
dx y 


or 


Methods for constructing trajectories for third and higher-order 
systems have been presented by Ku (14), and some of the topo- 
logical aspects of third-order systems (singular points and periodic 
solutions) are considered by Rauch (29). Phase-space considera- 
tions also have been used in the design of optimum relay servo- 
mechanisms (12). 

Some higher-order systems are adequately approximated by 
second-order differential equations; implications of this fact for 
phase-plane analysis are examined in some detail by Kalman (36). 


Appendix 2 


DescriBiING-FuNcTION METHOD 


In the describing-function method as outlined in the body of 
this paper, it is assumed that: 


1 The input to the nonlinear element is sinusoidal. 

2 Only one nonlinear element occurs in the system. 

3 The characteristics of the nonlinear element are independent 
of frequency. 


Inability to satisfy these assumptions in a particular problem 
introduces difficulties which we propose to discuss briefly. 

With a sinusoidal input, the output of the nonlinear element 
will necessarily contain harmonic components as well as the funda- 
mental component used to define the describing function. Ordi- 
narily, these harmonic components are smaller than the funda- 
mental component and are, in addition, reduced in magnitude by 
the filtering action of dynamic elements in the system. Where 
the harmonic components are not negligible at the input to the 
nonlinear element, a more complete analysis along lines suggested 
by Johnson is required (31). 

If the nonlinear element does not have a symmetrical input- 
output characteristic, the output will also-contain a constant or 
d-c component. This component is not filtered in the usual con- 
trol system and may, in fact, tend to be greatly amplified by 
integrators in the system. The magnitude of the d-c component 
at the output of a nonlinear element depends on the magnitude of 
both the d-c and sinusoidal components of the input, as does the 
magnitude of the fundamental component of the output. Equi- 
librium conditions in such systems are determined by considera- 
tion of two interrelated criteria similar to Equation [8]; details 
are available elsewhere (32). 

Some nonlinear control systems may exhibit a peculiar type of 
response in which a sinusoidal input at frequency nw produces an 
output at frequency w. In a study of this phenomena, West and 
Douce (33) have introduced a describing function which expresses 
the fundamental component of the nonlinear element output 
(having the frequency w) as a function of two sinusoidal input 
amplitudes, one with frequency w and another with frequency 
nw. Only one equilibrium condition, similar to Equation [8], 


507 


was important in their study, that relating to components at 
frequency w. 

The describing-function method is generally applied to systems 
having only one nonlinear element. If several nonlinearities 
occur simultaneously, it may be possible to represent them by a 
single describing function, as in the case of dead zone and satura- 
tion shown in Fig. 12. If two nonlinearities are separated by 
energy-storage elements (or if one nonlinearity is inextricably 
mixed with energy-storage elements), the combination must be 
represented by a describing function which is both amplitude- and 
frequency-dependent. The single curve —1/Gp in the polar plot 
must be replaced by a family of curves for a number of frequen- 
cies, and the condition for a sustained oscillation becomes 

Wy) 
where w,, denotes the frequency of oscillation. 

A quasi-linearization method, similar in concept to the de- 
scribing-function method, has been proposed by Booton for the 
study of systems subjected to random inputs (34). In his method, 
the nonlinear element is characterized by its response to a random 
input, and the result of the analysis is an estimate of the system 
r-m-s error. 


Discussion 


R. W. Bass.‘ This paper gives a clear summary of the 
methods now employed in nonlinear control-system analysis and 
synthesis. 

In the writer’s opinion, the describing-function method and (to 
a lesser extent) the phase-plane method are destined to be super- 
seded by techniques better adapted to the nature of the problem. 

The phase-plane, used in any of the forms mentioned by the 
author (Appendix 1), is a very powerful tool. An even more rapid 
method for obtaining a picture of the transient and steady-state 
response to steps and ramps has recently been introduced by 
R. E. Kalman.** 

For systems of order n > 2, Kalman’s techniques can be applied 
in the corresponding n-dimensional phase space. Such investiga- 
tions’? undoubtedly will continue to prove fruitful. However, they 
seem better adapted to analyses of a specific system (or class of 
systems) than to provide new synthesis insights (as they did for 
n = 2, 3). 

The describing-function method is highly overrated as a syn- 
thesis tool. Although it can be rendered mathematically legiti- 
mate,* it provides at most a negative criterion (even when it 
works). It does often indicate correctly whether or not the sys- 
tem is (statically) stable, but of course it has nothing to offer 
concerning optimization. Worse still, it can lead to the appear- 
ance of auxiliary nonlinear phenomena which are undesirable but 
which the method does not reveal.® 

For example, some relay servos cannot be stabilized by a com- 
pensating network unless so much error-rate feedback is em- 
ployed that the servo “chatters’’ in response to a step input. 


4Department of Mathematics, Princeton University, Princeton, 


+ **Physical and Mathematical Mechanisms of Instability in Non- 
linear Automatic Control Systems,’’ by R. E. Kalman, published in 
this issue, pp. 547-552. 

* See Bibliography (36). 

7**Analysis and Design Considerations of Second and Higher Order 
Saturating Servomechanisms,’’ by R. E. Kalman, Trans. AIEE, vol. 
74, part 2, Applications and Industry, 1955, no. 2, pp. 294-309. 

8 ‘Equivalent Linearization, Nonlinear Circuit Synthesis, and the 
Stabilization and Optimization of Control Systems,”” by R. W. Bass, 
to be published in Proceedings of the Symposium on Nonlinear 
Circuit Analysis, MRI Symposia Series, vol. 6, October, 1956. 


508 


This not only may degrade the response time, but (as the author 
has informed the writer) will cause the relay to wear out more 
quickly. 

Proofs of these statements, and a discussion of some new alter- 
native methods, are to be found elsewhere.* 


HERBERT SaunveErRsS.’ The author has presented a most in- 
teresting and timely paper on this subject. As stated by the 
author, there exist numerous methods of solving nonlinear dif- 
ferential equations but none is completely satisfactory. For engi- 
neering purposes one needs some sort of solution even if it is only 
approximate. On many occasions a reduction or a great simplifi- 
cation will reduce the complexity of the differential equations em- 
ployed in describing the problem so that the analysis may be simple 
as well as economical in respect to final results. Since most of our 
prevalent systems are of second order, many solutions are based on 
the linear second-order differential equation in which we may 
consider the higher-order equations as insignificant. If a signi- 
ficant nonlinearity which definitely cannot be ignored exists, the 
analytic solution becomes extremely difficult and arduous. The 
phase-plane method becomes very important when the equations 
cannot be integrated in closed form. Other writers~" similarly 
have gone into great detail on the advantages and disadvantages 
of the phase plane; their work is not repeated here. The phase- 
plane delta method has been touched on slightly, yet is one of the 
most powerful phase-plane methods available. Actually, the 
Lienard method as discussed in the Appendix of the paper is but 
a special case of this method. Consider a general differential 
equation 


Jé + He, é) = 0 


where H(e, é) is a nonlinear function including time as well as de- 
scribing any imposed external disturbance to which the system’s 
response is required. Following Jacobsen,'' the system can be 
put in a simpler form. Let 


H(e, = t) + fe 
and if 
w? = f/J 


then é+w%e +56) =0 


6 = é, t) 


where 


If ¢/w = y then 


dy 
é = wy — 


® Missiles and Ordnance Systems Department, General Electric 
Company, Philadelphia, Pa. Assoc. Mem. ASME. 

10 ‘*Phase Plane Analysis of Nonlinear Control Systems,"’ by I. R. 
Dalton, Research Report No. 3, University of Toronto, November, 
1954. 

11 ‘On a General Method of Solving Second-Order Ordinary Differ- 
ential Equations by Phase Plane Displacements,’’ by L. 8. Jacobsen, 
Journal of Applied Mechanics, Trans. ASME, vol. 74, 1952, pp. 543- 
553. 

12 “Geometrical Methods in the Analysis of Ordinary Differential 
Equations,’’ by J. Kestin and S. K. Zaremba, Applied Scientific Re- 
search, section B, vol. 3, 1953, pp. 149-189. 

18*°On Motions of an Oscillatory System Under the Influence of 
Flip-Flop Controls,"’ by I. Flagge-Lotz and K. Klotter, NACA TM 
1237, November, 1949. 


TRANSACTIONS OF THE ASME 
Then Equation [58] simplifies to the following 


d 
de 


Both Jacobsen'! and Bishop'* demonstrate the actual geo- 
metrical construction of this method. For further details the 
foregoing papers should be consulted. Considering that the equa- 
tion 


é+g(é) + wre = 


is but a special case of the phase-plane delta method, the graphical 
procedure is faster and more accurate than the Lienard method. 
This method can be applied to further problems concerned with 
systems subjected to transient loadings and multidegree-of- 
freedom systems. '* 


AvuTuor’s CLOSURE 


As workers in the field are well aware, there is an extensive 
literature on the subjects of nonlinear servomechanisms and 
mechanics. In writing this introductory paper, no attempt was 
made to survey the entire literature. The references provided by 
the discussers are a welcome addition to the paper. By tracking 
down the references listed in the references, and so on ad infinitum, 
the reader can find still additional information and can easily 
compile an extensive bibliography. 

The author shares the feeling that existing methods leave 
something to be desired, The present methods are probably 
adequate for second-order systems whose nonlinearities can be 
concentrated in a single box and which are subjected to specialized 
inputs such as steps, ramps, or sinusoids. The more complicated 
problems presented by higher-order systems, containing several 
isolated nonlinearities and subjected to realistic inputs, are be- 
ginning to receive the attention they deserve. Research workers 
in the field can profitably focus their attention on these problems. 

It may be well to add that analog or digital computers can, 
in principle, be used to study these complex systems. Computers 
can be regarded either as system models or mathematical ma- 
chines, and their speed and flexibility permit systematic examina- 
tion of a large number of alternative designs. Paper-and-pencil 
methods are, nevertheless, important. They provide insights into 
system behavior which cannot be obtained by looking at a pile 
of computer records. Experience with paper-and-pencil methods 
helps the designer to interpret computer results, diagnose troubles 
when they occur, and ‘decide on a rational system change for the 
next test. 

The goal, methods of analysis which answer all of the designer’s 
questions with no effort on his part, is unobtainable. Any steps 
toward the goal will be a useful contribution to the field. 


14“On the Graphical Solution of Transient Vibraticn Problems,"’ 
by R. E. D. Bishop, Proceedings of The Institution of Mechanical 
Engineers, vol. 168, 1954, pp. 299-322. 

18‘*Response of an Elastically Non-Linear System to Transient 
Disturbances,’’ by R. L. Evaldson, R. 8. Ayre, and L. 8. Jacobsen, 
Journal of The Franklin Institute, vol. 248, December, 1949, pp. 473-— 
494. 

16 ‘*Transient Vibrations of Linear Multidegree-of-Freedom Sys- 
tems by Phase-Plane Method,” by R.S. Ayre, Journal of The Franklin 
Institute, vol. 253, no. 2, 1952, p. 153. 


or 
4 dy e+6 
dey 


How to Obtain Describing Functions for 
Nonlinear Feedback Systems 


By KARL KLOTTER,' STANFORD, CALIF. 


A method is presented by which Describing Functions for 
nonlinear elements, whose behavior is defined by nonlinear 
differential equations (as contrasted with nonlinear rela- 
tionships between the input and output variables them- 
selves), can be obtained immediately (without first solving 
the differential equations). The method is applied to 
feedback systems containing one linear and one nonlinear 
element each described bya second-order differential equa- 
tion. 


STATING THE PROBLEM 

N linear-system analysis the concept of frequency response, 
in the form of (direct or inverse) transfer functions, is well 
known; and it has been found to be an extremely useful tool. 
As a matter of fact, it is of such usefulness that the temptation to 
extend this basically linear concept into the domain of nonlinear 
systems has proved irresistible. If we discount some suggestions 
in earlier sources (1),2 R. Kochenburger (2, 3) is to be credited 
with having developed a nonlinear counterpart to the transfer 

function that became known as the “Describing Function.”’ 

To emphasize the analogy to the respective linear concept, in 
this paper we frequently will replace the term Describing 
Function by ‘‘equivalent transfer function’’ with qualifying adjec- 
tives, such as “‘direct,’’ “inverse,”’ ‘““complex,”’ “real,’’ and so on, 
as the case may be. [Perhaps a more appropriate (although still 
longer) term would be “equivalent frequency-response function” 
with the pertinent qualifying adjectives. | 

Whatever name may be used for that function, the concept 
embodies the following idea: 

In a nonlinear element a sinusoidal input will produce a non- 
sinusoidal, although (for steady state) periodic, output. In order 
to perpetuate the use of a (complex) amplitude ratio one may 
work with the first harmonic of the periodic output. 

This suggestion sounds reasonable enough and good argu- 
ments can be offered to support it (4). Such arguments are es- 
sentially based on the filtering effect which takes place in the linear 
part of a transmitting system, in particular of a closed-loop sys- 
tem. On the other hand, because the higher harmonics, which 
are definitely present in the output, are disregarded, the results 
of using the describing function have to be watched attentively, 
even with some suspicion, and they always stand in need of care- 
ful checking and circumspect interpretation. In fact, the whole 
procedure warrants some more basic justification or at least some 
exploring of its inherent limitations. In this paper, however, we 
will not concern ourselves with those questions of justification or 
limitation. We will accept, more or less naively, the suggestion 
as proposed and the concept as customarily used and applied. 


! Professor of Engineering Mechanics, Stanford University. Mem. 
ASME. 

2? Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by the Instruments and Regulators Division of THe 
AMERICAN SocreTy oF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, 
Princeton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
17, 1956 Paper No. 56—IRD-5. 


In looking over the applications of the Describing-Function con- 
cept which have been made to date, one cannot fail to note that 
all cases which have been treated, as dead band (7, 8), saturation 
(7), limiting (3), linkages (5), contactors (2), coulomb friction 
(6), are such cases where the relationship between input and output 
ean be given by a nonlinear expression for the variables them- 
selves as contrasted with a differential equation. The treatment 
of the Describing Function in () is further testimony to that fact. 

Where, very occasionally, an element has been considered whose 
relationship between input and output is expressed by a differen- 
tial equation, the only suggestion offered has been to solve the 
differential equation by some computing technique (numerical or 
instrumental) and then to determine the first Fourier component 
of that computed output. Such a procedure is extremely tedious 
and clumsy, however. 

There was, up to date, no practical way known for obtaining 
the Describing Function for elements possessing differential rela- 
tionships and hence for feedback systems containing such ele- 
ments. It is the purpose of this paper to present a convenient 
method for producing the desired information, the first harmonic 
of the output function of an element whose behavior is described 
by a nonlinear differential equation. 

In earlier papers (10, 11) the author has explained the funda- 
mentals of such a method. Therefore here we will dispense with 
explaining the philosophy underlying it; we will rather focus our 
efforts on showing how the method can be used for producing the 
Describing Function of a nonlinear element without actually solv- 
ing its differential equation. It will become clear, the author 
hopes, that the method is well suited for the purpose at hand; in 
fact, it seems to be custom-made for the present needs. And, cer- 
tainly, its potentialities have not yet been exhausted. 


A METHOD FOK PRODUCING THE DescrIBING FuNCTION (Equiva- 
LENT TRANSFER FUNCTION) OF A NONLINEAR ELEMENT 


Let 


E(z) = M(z) — x*y(t) = 0 .f.] 


denote the nonlinear differential equation of the element under 
consideration with y(t) = Y cos wt standing for the input and z 
for the unknown output. By way of illustration we will treat the 
differential equation 


E =2 + 2Dxg(z) + x*f(z) — w*Y coswt = 0 [2] 


which happens to be of second order. It contains two nonlinear 
terms g(z) and f(z). For simplicity of presentation we will as- 
sume here that these two functions g(z) and f(z) are odd functions 
of their respective arguments. [In paper (11) the treatment is 
outlined also for the case of nonodd functions.} And as an exam- 
ple of Equation [2] we are going to treat the system described by 

g(z) =z f(z) = 2(1 + 
hence 


+ 2Dxz + x*z(1 + p22?) — coswt = 0... . [3] 


Equation [3] is known as ‘‘Duffing’s differential equation.” 
509 


510 


According to what was said earlier we wish to produce an ap- 
proximation 


= Zcos(wt—e).......... (4) 


to the output function z(t), with the parameters Z and e€ deter- 
mined in such a way that they represent “‘best’’ values in some 
sense or other. 

Because the function 2(/) Equation [4] cannot satisfy the dif- 
ferential Equation [2] at every instant, we are going to satisfy it 
in some ‘‘weighted average.’’ Appropriate weight functions can 
be shown (10, 11) to be cos wt and sin wt. The method (known as 
the Ritz averaging method or Ritz-Galerkin method) then asks 
for 


Qe ) 
E[z(c)] cosado = 0 


sin odo = 0 


These are two equations for determining the two parameters Z 
and ¢€ in the Approximating Function [4]. 

Without giving here any justification of the procedure we just 
list the steps which are to be taken: 

Step 1. From the two functions f(z) and g(z), which appear in 
the differential Equation [2] of the element under consideration 
we derive two new functions F(Z) and G(Z, w) by performing the 
following integrations 


4 1 
FZ) = — — f f(Z cos cos ode 
x Z 0 


or equivalently [6a] 


and 
411 
G(Z,w) = — = g(Zw sin a) sin 
0 
or equivalently 


4a-1. 
— — f g(Zw cos cos oda 
Z Jo 


By way of explanation we may add that Equations [6] amount 
to the following procedure: Introduce Z cos wi and Zw sin wt for 
z and 2, respectively, replace the functions f and g of these argu- 
ments by functions of multiple arguments (2wt, 3wt, etc.), and re- 
tain the fundamental terms only. 

Step 2. Using those functions F and G and abbreviating w/x by 
7 we obtain the amplitude Z and the phase shift ¢ from the two 
equations (which follow from Equations [5]) 


[F — + 4D°G? = 
and 
tane = = [7b] 


Passing on, now, to the special differential Equation [3] we find 
F and G to be 


TRANSACTIONS OF THE ASME 


3 


where use is made of the abbreviation Z = uZ. Hence the 


Equations [7] read 


3 2 Y \2 
and 
= [9b] 
i+ 


Equations [9] give both the modulus |N| = Y/Z and the 
argument € of the “inverse equivalent transfer function’’ 


{10} 


This (complex) transfer function can be plotted as a family of 
curves, either with Z as a parameter of the family and 7 varying 
along the individual curves, or with 7 as a parameter of the family 
and Z varying along the individual curves. In the next section, 
Fig. 2, we will see the plot for NV used for dealing with feedback 
systems containing the element which is under consideration 
here. The solid curves of one set (parabolas) in Fig. 2 represent 
the loci of N for values of Z = 0; 1; 2; 3, respectively; the other 
set (straight lines) represents the loci of N for the parameter val- 
ues 7 = 0; 1; 2; 3, respectively. 

One realizes readily that Equations [7] for the general case of 
differential Equation [2] or Equations [9] for the special 
case of differential Equation [3] are equivalent to 


cose = F — 


|\N| sine = 2DG 
with 
y 
or 
bay 3 
cose = 1 + 
sine = 2Dy 
with 


Y r 3 2 


respectively. 


EQuUIVALENT TRANSFER FUNCTION TO 
FEEDBACK SYSTEMS 


In parts (a) through (d) of Fig. 1 four cases of feedback sys- 
tems are listed, each containing two elements. The first element 
I, is supposed to be linear in all cases, the second one IT, either 
linear or nonlinear. The (complex) inverse transfer function of 
the linear element I is denoted by L1; the (complex) inverse 
transfer function of element II, if linear, by Lu, if nonlinear by 
Nu (then to be called “equivalent” transfer function). 

First, consider all elements to be linear. Then the over-all 
(complex) direct transfer functions are (with asterisks denoting 
complex amplitudes): 


APPLICATION OF THE 


APRIL, 1957 511 


In cases (a) and (b) Next, consider element II to be nonlinear in all cases. The 
only change we have to make is to replace the inverse transfer 
_. [11a,6] function Lm by its nonlinear counterpart, the inverse equiva- 
lent transfer function N11, as developed in the preceding section. 
Hence self-sustained oscillations may be expected if there are 


In case (c 
(c) roots to the equation 


1+ UiNu = 0 [13a] 


In case (d) 
Nu = —1/Ly. [136] 


U* Si ee : By way of example we consider such loops where the behavior 

of both elements is described by a second-order differential equa- 

Self-sustained oscillations in all of these systems will occur when tion. The differential equation for the linear element I will be 
the denominator vanishes specified as 


1+ Liln = 0.. 9+ by = or 2+ 62 = [14] 


Equation [12] is the basis for the use of the Nyquist criterion. the differential equation for element II, if nonlinear, will be as- 
sumed identical with Equation [3] (Duffing’s equation), specifi- 


cally 
La, Ny 


Ir z 2+ bez + + p22?) = 


+ bay + wo2y(1 + pty?) = 


) 


. [15] 


If both elements were linear (u = 0), their (complex) inverse 
transfer functions would be 


1 


hence 


Ki? + 2Dim 


Ky? + 4D,2n? 


Lu = (1 — 9?) + 2Deni. 


w/K2, 2D, = 6; /ke, 2D, = 


In the nonlinear case Ly is to be replaced by Nur 


Nu = (: + + 2Dani.........[19] 


Fic. 1 Feepsack Systems 
In Fig. 2 plots are shown of Ni (solid lines), Equation [19], 
and —1/Z; (long dashes), Equation [166], with the following 
numerical values used 


= 5000/sec?* k2? = 100/sec* 


[20] 
6; = 10/sec 6b, = 10/sec 


As is indicated on the plot, there does exist an intersection of the 
curve —1/L1(n) with the family of curves Nu(Z, n) for a com- 
mon parameter value 7. For the example shown this value hap- 
pens to be 7» = 2.56. Hence (stable) self-sustained oscillations 
| will occur. They have an (angular) frequency w = 25.6/sec and 

+0 an amplitude Z determined by u.Z = 4.03 (u has not been speci- 
fied here). 

4 5 6 Just in passing it should be mentioned that if element IT is 

—1/Iy(m) (ong dashes), Equation [16b | linear (u = 0) the closed-loop systems may or may not be stable, 
Nu(Z, ») (eolid lines), Equation [19] depending on the values of the parameters. For the numerical 
Fie. 2 Piotr or Functions values specified, the linear systems are unstable. This fact can be 


Lu 
or 
Ly 
(a) 
= or 
2 
and 
where 
La, Nz 
| 
.2=4.03 
— 


512 


verified easily either from the Nyquist criterion or from the 
Routh-Hurwitz criterion. 

The differential equation of the closed-loop linear systems 
reads 


+ (6, + + (id, + 


+ + = . [21] 
Therefore, the Hurwitz determinants become in turn 
H, = 6, + & 
Hz = 5,%2 + 5:52? + | 


.. [22] 
Hz = + 5152 + — 2x?) — x12(6,2 + J 

Whereas H; and H; are always positive (for positive damping) 
H; may or may not be positive. For the numerical values speci- 
fied, H; is negative; hence the linear systems are unstable. 

We have, therefore, before us systems which, if linear, prove to 
be unstable (showing indefinitely increasing amplitudes) but 
which can be restrained to a limit cycle by the addition of a non- 
linear term, uz? in our case. 

Details of the various dependencies (of amplitudes and phase 
angles on frequency w, on damping coefficients 6; and 62, on non- 
linearity 4?, and so on) can be discussed; the results may be pre- 
sented in a separate paper. Replacing the graphical procedure 
used here by an analytical one will help to improve the accuracy 
of the resulting numerical values. 


ACKNOWLEDGMENTS 


This paper presents some partial results obtained in the course 
of studies aided by the National Science Foundation. The author 
wishes to acknowledge the assistance of Mr. Hugh L. Smith, 
graduate student in Engineering Mechanics at Stanford Uni- 
versity. 


BIBLIOGRAPHY 


1 “Dynamik selbsttitiger Regelungen,”’ by R. C. Oldenbourg 
and H. Sartorius, Miénchen, Germany, 1944, translated by H. I. 
Mason, ‘“‘Dynamics of Automatic Control,’’ New York, N. Y., 1948. 

For further references to earlier papers see the discussion remarks 
by G. A. Philbrick to paper (2), and the remarks in (9), p. 556, con- 
cerning papers by L. C. Goldfarb, W. Oppelt, and A. Tustin. 

2 “A Frequency Response Method for Analyzing and Synthesiz- 
ing Contactor Servomechanisms,”’ by R. J. Kochenburger, ATEE 
Trans., vol. 69, part 1, 1950, pp. 270-284. 

3 “Limiting in Feedback Control Systems,”’ by R. J. Kochen- 
burger, AIEE Trans., vol. 72, part 2, July, 1953, pp. 180-194. 

4 “Recent Advances in Nonlinear Servo Theory,”’ by J. M. 
Loeb, Trans. ASME, vol. 76, 1954, pp. 1281-1290 (more literature is 
listed there). 

5 “Sinusoidal Analysis of Feedback-Control Systems Containing 
Nonlinear Elements,” by E. C. Johnson, AIEE Trans., vol. 71, part 2, 
July, 1952, pp. 169-181. 

6 “Coulomb Friction in Feedback Control Systems,” by V. B. 
Haas, Jr., AIEE Trans., vol. 72, part 2, May, 1953, pp. 119-126. 

7 ‘Approximate Frequency-Response Methods for Representing 
Saturation and Dead Band,” by H. Chestnut, Trans. ASME, vol. 76, 
1954, pp. 1345-1364. 

8 “Stability Characteristics of Closed-Loop Systems With Dead 
Band,” by C. H. Thomas, Trans. ASME, vol. 76, 1954, pp. 1365- 
1382. 

9 “Automatic Feedback Control System Synthesis,’ by J. C. 
Truxal, McGraw-Hill Book Company, Inc., New York, N. Y., 1955, 
chapter 10. 

10 ‘Nonlinear Vibration Problems Treated by the Averaging 
Method of W. Ritz,” by K. Klotter, Proceedings of the First National 
Congress of Applied Mechanics 1951, ASME, New York, N. Y., 1952, 
pp. 125-131. 

11 “Steady State Vibrations in Systems Having Arbitrary Re- 
storing and Arbitrary Damping Forces,”’ by K. Klotter, Proceedings, 
Symposium Nonlinear Circuit Analysis, Polytechnic Institute of 
Brooklyn, Brooklyn, N. Y., 1953, pp. 234-257. 


TRANSACTIONS OF THE ASME 


Discussion 


R. W. Bass.* This pape? greatly enlarges the class of feedback 
systems to which the describing-function method can be applied 
conveniently. The extension of the “equivalent transfer func- 
tion’’ from elements described by nonlinear functions to elements 
described by nonlinear differential equations is both a realistic 
and a welcome advance in the art. 

As the author says, “the whole procedure warrants (1) some 
more basic justification, or at least (2) some exploring of its in- 
herent limitations.” 

The procedure is essentially a heuristic criterion for the ex- 
istence of periodic solutions of nonlinear differential equations. 
A rigorous criterion as suggested in (1) would involve the solu- 
tion of infinitely many implicit equations in infinitely many un- 
knowns. For that there are two techniques available: (a) The 
Ritz-Galerkin variational methods, as e.g., in 
Lichtenstein’s book on nonlinear integral equations and in the re- 
cent work of Rothe; (6) the fixed-point techniques of Lefschetz 
and Leray-Schauder. 

In a recent contribution to the Symposium on ‘Nonlinear Cir- 
cuit Analysis,’’* the writer investigated problems (1) and (2). 

By using technique (b) the infinite system can be reduced to a 
single pair of simultaneous equations (which determine the fre- 
quency and the fundamental amplitude). 


developed; 


These equations are a 
generalization of the classical bifurcation equations; if they can 
be solved there will be a self-sustained oscillation. In some cases 
they can be solved; this leads to an answer to question (1). 

In general, however, it is too difficult to treat these equations 
directly. But on putting a certain term equal to zero, a pair of 
easily solved, approximate bifurcation equations is obtained. If 
one splits the author’s Equations [12] or [136] into real and 
imaginary parts, one finds just these equations. In other words, 
the describing-function method is actually a graphical tech- 
nique for solving the approximate bifurcation equations. There 
are very general conditions under which the existence of this 
approximate solution necessitates the existence of an exact solu- 
tion. When these conditions are met, the describing-function 
method is legitimate; question (1) is answered. 

But it is often impractical to verify these conditions. What 
then of (2)? It can be shown that the higher the frequency found 
from Equations [12] and [13], the closer will be these equations to 
the true bifurcation equations. It seems plausible then that for 
high-frequency oscillations the describing-function method is 
more likely to be valid. For this and other reasons we suggest 
this as a tentative contribution to the author’s second question. 


AvuTHOR’s CLOSURE 


It is pleasing to learn that Dr. Bass is able to report some 
decisive steps toward answering the two questions posed in the 
paper and listed as (1) and (2) in the discussion. Dr. Bass’ 
paper‘ will merit close attention. 

The author perhaps is permitted to draw attention here to his 
own paper in the symposium‘ in which he discusses in more de- 
tail the difference between the conventional concept of the de- 
scribing function and the “new describing function’’ as proposed 
in the present paper. A distinction is made there between a 
“Fourier Describing Function’? and a “Hamilton Describing 
Function’’; the former being based on a first harmonic in the 
“Fourier sense,’’ the latter on a first harmonic in the “Hamilton 
sense’’ minimizing the Hamiltonian integral. 


* Department of Mathematics, Princeton University, Princeton, 
N. J. 

‘ Proceedings, Symposium on Nonlinear Circuit Analysis, Poly- 
technic Institute of Brooklyn, Brooklyn, N. Y., vol. 6, October, 1956 
(to be published). 


Design and Analog-Computer Analysis 
of an Optimum Third-Order 


Nonlinear Servomechanism 


By H. G. DOLL! ano T. M. STOUT? 


Great interest has been shown recently in deliberately 
nonlinear control systems, including a class of pro- 
grammed control systems. The design objective which 
distinguishes these systems is minimization of the re- 
sponse time for step inputs by proper automatic timing 
of relay operations. For a second-order relay servomecha- 
nism, the required switching relation is expressed by a 
curve in a two-dimensional phase plane which can be real- 
ized by a one-variable function generator in combination 
with linear elements. For a third-order system, the 
switching relation is expressed by a surface in a three- 
dimensional phase space and requires a_ two-variable 
function generator for its realization. The switching sur- 
face is computed in this paper for an idealized third-order 
positioning servomechanism having an output member 
characterized completely by its moment of inertia and a 
torque which varies linearly with time between two limits; 
small-signal nonlinearities such as backlash or relay 
threshold are neglected. An electro-optical two-variable 
function generator is employed in an analog-computer 
study of this system. For purposes of comparison, two 
alternative modes of control also are examined. The 
system using programmed control shows the expected 
superiority for step inputs, the advantage being greatest 
for small step magnitudes. For sinusoidal or random 
inputs, programmed control is superior when the input 
amplitude and/or frequency are low but exhibits some 
anomalous behavior for large amplitudes and/or fre- 
quencies, which result in inferior performance. Param- 
eter tolerances for programmed control systems are some- 
what less severe than anticipated. 


INTRODUCTION 


ETHODS for the analysis and synthesis of linear feedback 

M control systems have had an intensive development 
since 1940. At the same time, there has been a growing 

interest in the analysis and design of nonlinear systems, directed 
toward eliminating or minimizing harmful effects of nonlinearity 
and exploiting useful effects of nonlinearity. Work in the latter 
area has been concerned particularly with the design of systems 
subject to saturation, leading to the development of a class of de- 
liberately saturated or relay control systems to be described. 


1 Schlumberger Instrument Company, Ridgefield, Conn. 

2 Work done at Schlumberger Instrument Company, Ridgefield, 
Conn.; now with the Ramo-Wooldridge Corporation, Los Angeles, 
California. 

Contributed by the Instruments and Regulators Division of THe 
AMERICAN Society OF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Janu- 
ary 6, 1956. Paper No. 56—IRD-10. 


This development is motivated by the following line of reasoning. 

In many control systems the correcting action is made pro- 
portional to a linear combination of the system error, error rate, 
integrated error, and possibly other signals. This proportion- 
ality cannot, however, exist under all conditions, since a maxi- 
mum corrective action is always imposed by inherent limitations 
of the materials and energy sources employed. If these limits are 
to be reached only under extreme conditions, the system evi- 
dently must be designed to operate most of the time at less than 
maximum capacity. Accuracy and speed of response, it is felt, 
are therefore not as good as they might be. 

In the limit, as the proportionality constant is increased, the 
saturating linear system becomes effectively a relay system, in 
which a manipulated variable (such as field voltage) is switched 
between two fixed values. Stable operation can be achieved if 
the switching operation depends on a sufficient number of varia- 
bles. 

If the switching operation is made a nonlinear function of the 
system variables, a relay system can be designed which returns 
to equilibrium from any combination of initial values of the varia- 
bles in a minimum time. In particular, the system is optimum 
in the sense that its response time for a step input is a minimum; 
this is the usual basis of design of such systems. Relay systems 
designed to obtain this objective may be called “programmed 
control systems” since the controller automatically programs or 
times the switching operations. 

Work on a programmed controller for a third-order system was 
done by H. G. Doll in 1942, resulting in a patent application (1)* 
in 1943; in this work the second-order system was considered a 
limiting special case of a third-order system. According to 
West (2), independent work on programmed control for a second- 
order positioning servomechanism is contained in an unpub- 
lished report written by F. C. Williams in 1942. Oldenburger 
states (13) that he worked out the design for second and third- 
order cases in 1944. Additional work on second-order systems 
has been reported by McDonald (3), Hopkin (4), Uttley and 
Hammond (5), Lathrop (6), West (7), Bushaw (8), Neiswander 
and MacNeal (9), and others. Research on third-order systems 
was recently described by Bogner (10), Silva (11), Preston (12), 
and Hopkin and Iwama (25). 

The possibility of extending the programmed control idea to 
higher-order systems has been recognized but no experimental 
work in this direction has been reported. Practical difficulties 
arise from the fact that a programmed controller for an nth-order 
system requires at least an (n — 1)-variable-function generator. 
Lack of suitable two-variable function generators has hampered 
even laboratory investigation of third-order systems. In the 
most extensive experimental investigation of a third-order sys- 
tem yet published (12), Preston used two single-variable diode 
function generators and an interpolating servomechanism to 
approximate the required function of two variables. 

After a brief review of the theory underlying a second-order 

3 Numbers in parentheses refer to the Bibliography at the end of 
the paper. 


513 


514 


programmed control system, this paper describes the calculation 
of the switching relation for a particular third-order system and 
a laboratory realization, using an electro-optical two-variable 
function generator, for use in an analog-computer investigation. 
Results of computer tests with a variety of inputs are given 
for this system as well as competitive linear and relay modes of 
control. 


SEeconp-ORDER SYSTEM 


In the positioning servomechanism considered in past studies, 
it generally has been assumed that the output member was 
completely characterized by its moment of inertia J and that 
the torque supplied by the motor (7’ = + T',,) could be reversed 
instantaneously. Parasitic nonlinearities, such as backlash or 
relay hysteresis, have ordinarily been neglected. With these 
assumptions, the behavior of the system is described by the 
second-order differential equation 


where c is the controlled variable or output, and dots denote 
differentiation with respect to time. For step inputs, Equation 
{1] becomes 


Jé=—T = FT, 


Optimum step-function response is obtained by using maxi- 
mum accelerating torque until the error is reduced to half its 
original value and maximum decelerating torque until the error 
is reduced to zero, as shown in Fig. 1. 


+1 


Oo i 2 3 4 5 


Step Response A PROGRAMMED SECOND-ORDER SERVO- 
MECHANISM 


Fie. 1 


To obtain optimum response for all step magnitudes, the 
torque is determined continuously from the relation 


2T 
T = T,, sgn | 7° aa | 


where sgn denotes x/|z}. 
Torque reversal occurs on the switching curve defined by 


TRANSACTIONS OF THE ASME 


which divides the e-é phase plane into regions of positive and nega- 
tive torque as shown in Fig. 2(A). With this switching relation, 
all system trajectories have the same general appearance and 
coincide with the switching curve approaching the origin. The 
response time for step inputs is 


where é is the initial error or step magnitude (14). 
e 
T= 


TRAJECTORY 


-T 


SWITCHING CURVE 


OIFFERENTIATOR 


DIODE FUNCTION 
GENERATOR 


(B) 


Swircainc Curve FoR A ProGRaAMMED 
SERVOMECHANISM AND A PossiBLeE REALIZATION 


Fie. 2 


A possible physical realization of the switching relation is 
shown schematically in Fig. 2(B). As indicated, the sense of the 
torque is a function of both error and error rate, but the physical 
realization requires only a single-variable function generator in 
combination with linear elements. Negative output rate (—é) 
can be used instead of error rate without affecting the response 
for step inputs; this substitution does, however, result in a 
steady-state error for ramp inputs proportional to the square of 
the input rate. 

The position and shape of the switching curve can be changed 
to obtain an optimum step response in the face of continuous 
coulomb or viscous friction torques, neither of which affect the 
order of the differential equation (14). 


TuirD-ORDER SYSTEM 


A finite time ordinarily will be required for the torque to go 
from oné extreme value to the other. The variation of torque 
with time may take several forms, all of which raise the order 
of the differential equation and require a more complicated con- 
troller. 

Bogner (10) and Preston (12) consider systems in which the 
torque varies exponentially with time, described by the equations 


= 
2T 
J 
| 
(A) 
RELAY 
| 
MOTOR 
fe) 
é | 
-2 | 
| 
« 
10 
8 
4 
2 


APRIL, 1957 


which reduce to 


The time constant 7, might represent the field time constant of 
a d-c servomotor. 

In the system considered here, the torque is assumed to vary 
linearly with time between its extreme values, so that system 
operation is described by the equations 


otherwise 


and 7 is the time required for the torque to go from one extreme 
value to the other. These equations, which may also be written 


J otherwise. . . 
can be regarded as a description of a highly idealized steering 
system in which a rudder is driven between limit stops by a con- 
stant-speed motor, producing a torque on the steered body pro- 
portional to the rudder angle (1), (15). They also may be 
taken as a simplified description of a missile or aircraft roll-control 
system in which a rate-limited hydraulic servo is used to position 
the control surfaces (16-18). 

Step-Function Response. Figs. 3 and 4 show two character- 
istic step-function responses of this third-order system. For 
small step inputs, such that the initial error éo satisfies the relation 


the maximum error acceleration or torque is not required and the 
response takes the form shown in Fig. 3. Under these conditions 
the response time can be shown to be 


= €.. 


when ¢, is the time required for the error to become zero. 

For larger step inputs, maximum error acceleration is reached 
and the response is as shown in Fig. 4. For very large step 
inputs, such that maximum torque is employed for almost the 
entire response, the response times are only slightly greater than 
the values given by Equation [5]. 

Figs. 1 and 4 are drawn for the same 7’,,, J, and response time, 
and show a smaller initial error for the third-order system; the ex- 
planation is, of course, that the acceleration in the third-order 
system is at most equal to that of the second-order system, and the 
error rate is smaller at all times. The third-order system there- 
fore has a greater response time for the same initial error or step 
magnitude. Nevertheless, the responses shown in Fig. 3 and 
4 are the best possible with the prescribed values of J, T,,, and 
dT /dt; there is no switching procedure which will return the 
system to equilibrium in a shorter time. The response time can 
be reduced only by decreasing J, increasing 7',, or d7'/dt, or by 


[13] 


4In reference (1), the switching procedure and mechanism are 
developed for a system whose equation of motion also includes a 
term corresponding to the restoring torque produced by aerodynamic 
forces acting on the steered body itself. 


3 


2 
Response oF A PROGRAMMED THIRD-ORDER SysTeM 


+14 


+ +t 


3 a 5 6 
Fic. 4 Strep Response or a ProGgrammMep Tuirp-Orper System 


Nn 


some other modification of the system (such as a brake added 
only during the decelerating interval). 

The torque rate d7'/dt is reversed twice, as indicated by the 
dashed lines in Figs. 3 and 4. Since these switching operations 
occur at unique combinations of e, é, and é, the switching relation 
for this third-order system is expressed by a “switching surface”’ 
in a three-dimensional phase space.* 

Calculation of Switching Surface. The switching surface can 
be obtained by associating with each point in the e-é plane the 
value of error acceleration required if the system is executing an 
optimum response. This process is best carried out by con- 


8 Reference (16) describes means for proper timing of only the first 
reversal which result in near-optimum performance. 


515 
J 
é 
where 4. ivi 
2 
tJ 
Fie. 3 
e 
| 
€ 
oy [ ] : 
-2 
10 
2 
| 


516 


structing a number of optimum trajectories, starting from the 
origin and working backward from the end of the trajectory in 


negative time. 
For ease of calculation, the values 7,, = 1, J = 1,and rt = 2 
will be used, reducing Equations [10] and [11] to 


= +1 
é= +1 


These substitutions are equivalent to using the nondimensional 
variables 


which convert Equations [10] and [11] into 


+1 |T|< T,,. 


c”’ = +1 otherwise 


where primes denote differentiation with respect to t. If Equa- 
tions [14] and [15] are used, the discussion can be carried out in 
the original notation and later generalized with the help of 
Equations [16] and [17]. 

For step or ramp inputs, such that 


we have 


so that Equations [14] and [15] can be written 
= ¥1 
é= F1 


The switching relation can therefore be expressed in e-€-é Co- 
ordinates and will serve for both step and ramp inputs. 

As in the second-order system, the final switching op- 
eration occurs on a system trajectory passing through the 
origin of the phase space. The equations for one of two 
such trajectories are 


é=—t=—T 


é=—— 


2 


TRANSACTIONS OF THE ASME 


Half of the switching surface consists of trajectories terminat- 
ing on those of Equation [25] at the co-ordinates given by 
Equation [26]; the other half is obtained by symmetry considera- 
tions. For t < —+t,, the equations are 


é=t + 2t, 


2 
2 + 2tt + 


(¢ + 


+ tf + + 


1 
g + 2t,) + 2t,)* — 64,3] + 64,7} 


Letting 7 = e, y = ¢, andz = @, eliminating ¢, and rearranging 
in a form suitable for numerical computation, the equation of the 
switching surface is found to be 

| 


(y— 


Although Equations [27] and [28] seemingly apply only when 
t, is less than unity and therefore only when the step response is as 
shown in Fig. 3, they actually have a wider application. In the 
present system, the useful portion of the switching surface is 
bounded by the planes é = z = +1, since J = T7,, = 1 and the 
absolute value of é cannot exceed unity. However, the shape 
of this portion of the surface is unaffected by the presence of a 
torque limit and would be unchanged if 7',, were infinite. Hence 
quation [28] can be used to compute the entire switching sur- 
face. The planes é = z = + 1 are not to be considered part of 
the switching surface. 

The torque limit must, of course, be considered in computing 
system trajectories; these trajectories are confined to the space 
between and including the bounding planes and may have, in 
some cases, segments in common with the intersection of the 
switching surface and the bounding planes. . Equations [14] and 


. [28] 


with ¢ negative; é and e are positive, 7 and é negative, 
and the projection of this trajectory appears in the fourth 
quadrant of the e-€ plane. The other trajectory is sym- 
metrical with respect to the origin. 

At the final switching point ¢ = —t, and 


(t, > 0) 


Fic. 5 Swircninc Surrace AND TRAJECTORIES FOR THE PrRo- 


GRAMMED THIRD-ORDER SysTeM 


= 
otherwise............... [18] - 
= 
2 [27] 
4J 
2 
T | 
4 
6 
q 
a 
% 
—t? 
é,3 
e=— 


APRIL, 1957 


[15] are used alternately as the system goes from saturation in 
torque rate to saturation in torque magnitude, solutions being 
fitted together at the transition points. Typical trajectories 
computed in this manner are shown in Fig. 5, superimposed on 
a plot of the switching surface. Since é = —T for the numerical 
values used here, the surface is plotted as lines of constant torque 
rather than error acceleration. 
Controller Realization. Equations [6] and [21] state that 


(29) 


for step and ramp inputs, indicating that é = z in Equation [28] 
could be replaced by —é (or -—T7, if J = 1) without detriment to 
the step or ramp response of the system. In the hypothetical 
steering or roll-control systems mentioned previously, the torque 
was assumed proportional to a control-surface deflection which 
could be measured by a position-sensing device (1). Alternatively 
é might be measured directly; this method is advantageous if the 
system is subject to torque disturbances. 


To 


CONTROLLED | SYSTEM 


CONTROLLER 


enn | TORQUE SOURCE 


Biock DIAGRAM OF THE PROGRAMMED THIRD-ORDER 
SysTeM 


Fie. 6 


If the torque is measured, the realization of the controller can 
take the form shown in Fig. 6. The output of the two-variable 
function generator is the torque computed from Equation [28], 
T. = —Jz = f(e, €), which must exist for each combination of 
e and é on an optimum trajectory. The relay input 


e=T7,—T.. [30] 


the difference between the desired and actual torques, initiates 
action bringing the two torques into correspondence. Since the 
rate of change of torque called for on the optimum trajectories 
does not exceed the capabilities of the system, the controller can 
enforce continuous correspondence between the computed and 
actual torque after they have been brought into agreement. 
Operation of the system under general conditions is described 


by 


T = sgn [f(e, €) — T) \T| <1.. [31] 


T = +1 otherwise 


and can be understood with the help of Fig. 5. Assume that the 
system starts at point A with 7 = —é = 0. Since this point 
lies below the switching surface, the relay acts to make d7'/dt = 
+1 and the system starts along the dotted path to point B. The 
initial slope of the trajectory is zero because, under all conditions 

dé 


de [32] 


and é = 0, € + 0 at point A. At point B, the actual torque 
agrees with the torque used in computing the optimum trajectory 


(T = 1); the two trajectories are tangent at this point by virtue 
of Equation [32]. 

If the system is not subjected to further disturbances, the sys- 
tem follows the previously computed optimum trajectory from 
B to the origin. The first switching occurs at point D when the 
trajectory reaches the switching surface; here d7’/dt becomes —1 
and the torque starts from +1 toward —1. For this particular 
set of initial conditions, the second switching occurs (at point F) 
just as the torque reaches —1. For initial points farther from 
the origin, the trajectory follows the curve 7’ = —1 before reach- 
ing F; for points closer to the origin, the second switching occurs 
on the trajectory joining F and O. 


Systems Srupiep 


An analog-computer investigation of this system, using a pro- 
grammed controller as well as competitive linear and relay con- 
trollers, was conducted to examine the behavior for inputs which 
are not easily handled by analytical methods and to get an idea 
of the comparative performance of the various modes of control. 
Step-function response curves, obtained by analytical and graphi- 
cal methods, were used to check the performance of the computer. 

Programmed Controller. The two-variable 
function generator used for programmed control of this system 
in these experiments employs a photoelectric tube to count pulses 
generated by the passage of lines on a rotating disk through a 
light beam. The light beam is positioned by a galvanometer 
excited by one input, while the duration of the counting interval 
is controlled by the other input. The number of lines counted 
per revolution is converted to an analog voltage which is a func- 
tion of the two inputs. The nature of the function generator af- 
fects the analog-computer configuration in several ways. 

The continuous variation of 7’ over the switching surface is re- 
placed by a stepwise variation using a number of steps which is 
limited by space and resolution considerations. In these tests, 
sixteen lines (corresponding to 7/7, = +15/16, + 13/16, + 
11/16,..., + 1/16, — 1/16, ..., —15/16) were used. Use of 
an even number of lines gives a region of zero torque on either 
side of the nominal zero-torque line, at the origin as well as in the 
rest of the plane. Some inaccuracy in the determination of 
switching points results from this approximation. 

To make more efficient use of the available disk area and to 
avoid resolution problems in the neighborhood of the origin, it 
was considered advisable to express the switching surface in 
terms of new co-ordinates. A 45-deg rotation was arbitrarily 
adopted and (e + é), (e — é) were used as the inputs to the fune- 
tion generator. The range of values used was —0.7 < (e + é) < 
+ 0.7 and —1 <(e — é) < + 1; operation was thereby limited 
to step inputs producing an initial error of approximately one 
unit. 

The disk rotates at 1800 rpm in this function generator, so that 
samples of the output may be considered to be available every 
1/30 sec. The theoretical upper frequency limit on the output 
of the function generator is therefore 15 cps, with a smaller limit 
(3 — 4 eps) imposed by practical filtering problems. To avoid 
possible difficulties introduced by filter lags, the analog com- 
puter was operated with a time unit (7/2) of 10 sec. 

A photograph of the disk and a diagram of the computer con- 
figuration, based directly on Fig. 6, are given in Figs. 7 and 8. 

Linear Controller. A schematic diagram of a linear controller 
for this system is shown in Fig. 9. For the same numerical 
values as before and using acceleration feedback instead of 
torque feedback, operation of this system is governed by the 
equations 


electro-optical 


T = [33] 


517 
7 
| 
| 
{FL 
| | 
|_| 
| 


Fic. 7 Disk Usep Function GENERATOR 


CONTROLLER _ _ 


GENERATOR 


CONTROLLED System 
i| 
TORQUE SOURCE 


8 CompuTEeR CONFIGURATION FOR PROGRAMMED CONTROL 
System 


Fia. 


. [34] 
. [35] 


T = sgn le— KZ] 
T = +1... 
Equation [33] applies only if 


1 


le— — < 


[36] 


and 


while Equation [34] applies if the inequality of Equation [36] is 
not satisfied, and Equation [35] applies if neither Equation [36} 
nor Equation [37] is satisfied. 

In the region of linear operation described by Equation [33], 
the system transfer function is 


Cs) K 
R(s) K,Ks? + K,K, + K 


The gain K influences the speed of response or band width of the 
system and also fixes the size of the linear operating region. 
With K = 1, the value which was arbitrarily adopted, a unit 
step input with the system at rest will just produce saturation 
inc. 

Values of the rate and acceleration constants K, and K, were 
selected, using data given by Graham and Lathrop (19), for 
linear behavior which was a satisfactory compromise between 


TRANSACTIONS OF THE ASME 


CONTROLLER 


Fic. 9 Brock DiaGram or Tuirp-Orper System Wirth LINEAR 
or Retay CONTROLLER 


speed of response and degree of damping; the values used were 
K, = 1.9 and K, = 2.28. With these values, the transfer func- 
tion becomes 


C(s) _ 1 
R(s) 8 + 1.98? + 2.288 + 1 
1 
. [39] 
(s + 0.7) (s? + 1.28 + 1.44) 

Transient solutions thus consist of an exponential term with a 
time constant 1/0.7 = 1.44 units and damped sine and cosine 
terms with a damping ratio ¢ = 0.5 and an undamped resonant 
frequency @» = 1.20. 

Relay Controller. If the gain K becomes infinite, the region of 
linear operation in the previous system vanishes and the con- 
troller becomes a relay controller. Operation of the system is 
now described by Equations [34] and [35]. For step inputs 
when ¢ = —é, Equation [34] may be written 


T = sgn [e + K,¢ — K,T] 


gn a e a e 


sgn — 7'] 
where 


1 
= 
+ K. {41} 
Comparison of Equation [40] with Equation [31] indicates that 
the relay controller may be regarded not only as a linear controller 
with infinite gain but also as an approximation to the programmed 
controller, in which the curved surface is replaced by a plane. 
This observation suggests a basis for selection of the constants 
K, and K,, which should be given different values than were used 
in the linear controller. 
From Equation [41], it is evident that the line 7, = + 1 inter- 


sects the co-ordinate axes at 
E=K, 
K, 
K, 


Inspection of the switching surface in Fig. 5 suggests that suita- 
ble values would be found in the range 


05<E<08 | 


[43] 


518 
Peg = \ r i j 
| 
| 


APRIL, 1957 


Fic. 10 ApprROXIMATION OF SwiTCHING SurFace Usep RELAY 


CONTROLLER POR THIRD-ORDER SysTEM 


0.5<K,<08 | 
[44] 
1<K,< 2} 


By experiment, the values K, = 0.7 and K, = 1.4 were chosen as 
a compromise between speed of response and degree of damping 
for both small and large step inputs. The resulting approxima- 
tion of the switching surface is shown in Fig. 10. 


oF CoMPUTER INVESTIGATION 


A variety of inputs and disturbances was employed in the 
computer investigation. Results of these tests are summarized 
in the following sections. 

Step Inputs. Responses of the three systems for step inputs, 
traced from the original records, are shown in Fig. 11. Response 
time, arbitrarily defined as the time required for the error to be- 
come and remain less than 0.025 unit, is plotted for the three 
systems in Fig. 12. For the smallest step magnitude employed, 
the response time for the programmed system is less than one 
half the response time for the relay system and about one fourth 
as great as for the linear system. For large step magnitudes, the 
differences are less pronounced. 

Sinusoidal Inputs. Samples from more extensive records of 
tests using sinusoidal inputs are shown in Fig. 13; numerical 
values of the error magnitudes can be obtained from Fig. 14. 


LINEAR 


STEP MAGNITUDE - UNITS 


Srer Response or Tuirp-Orper System Wits Linear, 
RELAY, AND PRoGRAMMED CONTROL 


UNITS 


TO ERROR §€ 0.025 UNIT) 


RESPONSE TIME 


(TIME 


0.04 0,06 ai 0.2 
STEP MAGNITUDE - UNITS 


‘1G. 12 Response Time For Step Inputs With Linear, Revay, 


AND PROGRAMMED CONTROL 


INPUT = 0.2 UNITS (PEAK -TO~ PEAK) 


LINEAR 


RELAY 


PROGRAMMED 


FREQUENCY - CYCLES / UNIT TIME 


Fic. 13 Response or Txurrp-Orper System To In- 
puts Linear, RELAY, AND PrRoGRAMMED CONTROL 
T 


| 
1.0 + +—+ 
| | | 
| 

| | 

4 
| 


V 


ERROR (PEAK - TO- PEAK) UNITS 


0.02 0.04 
FREQUENCY - CYCLES/ UNIT TIME 


Fie. 14 Error Maonrrupe For a Sinvsorpat Input or Frxep 
AMPLITUDE AND VARYING FREQUENCY 


Figs. 13 and 14 indicate that the programmed system has a 
very small error at low frequencies and that its band width, 
defined arbitrarily as the frequency at which the error magni- 


519 
~ 10 
N 
» 
» a 
sy \ | 
| | Kan | 
>» J 
‘ tn. 
~ 
| 
| 
0.2 — — 
e(t) 
0.1 0.2 o4 on 
os 
RELAY 
e(t) = 
| | RELAY | 
LINEAR —. J | | 
‘ 
| 
0.1 028 0.50 0.75 1.00 


520 


tude is one quarter or one half of the input magnitude, is several 
times greater than for the linear and relay systems. At high 
frequencies, however, the error for the linear and relay systems 
becomes equal to the input but the error magnitude for the pro- 
grammed system becomes four or five times as great as the input. 

The anomalous behavior of the programmed system at high 
input frequencies consists of fairly large torque and output 
motions having a fundamental frequency which is one third of 
the input frequency or less. As other records (not included 
here) show, this frequency is relatively constant as the input 
frequency is increased beyond the point at which this behavior is 
first observed. The error, being the difference of input and 
output, contains components at both the input and output fre- 
quencies. 

Since the response of the programmed system to sinusoidal 
inputs had not been calculated analytically in advance of the 
computer experiments, this anomalous behavior was not antici- 
pated. A limited number of checks indicated that the system 
equations are satisfied with reasonable accuracy in this mode of 
operation, so that no gross errors in the computer simulation 
could be held responsible. Subharmonic behavior in simpler 
nonlinear systems is discussed extensively in the literature on 
nonlinear mechanics [see, for example, Minorsky (20) or the more 
recent text by Stoker (21)] and isolated occurrences in feedback- 
control systems have been reported (22, 23), so that these results 
are not considered surprising. 

The input frequency at which the anomalous behavior is first 
observed is an inverse function of the input magnitude, increasing 
from about 0.5 cycles/unit time for the case shown to about 
0.75 cycles/unit time if the input magnitude is reduced to 0.025 
units (peak-to-peak ). 


INPUT * 0.15 UNITS (PEAK - TO-PEAK) 


| LINEAR 


MEDIUM FREQUENCY HIGH FREQUENCY 
Fie. 15 Response or Turrp-Orper System To A SEMIRANDOM 
Input With LINEAR AND PROGRAMMED CONTROL 


Random Inputs. Semirandom inputs, generated by a cam- 
driven linear potentiometer, gave results typified by the sections 
of record shown in Fig. 15.. The behavior for this input is some- 
what similar to that for sinusoidal inputs; the error amplitude 
for the programmed system is small when the input frequencies 
are small and is greater than the input amplitude when the 
input frequencies are large. 

Which system is better for this input depends on the point of 
view. If the input is regarded as noise which the system is 
supposed to ignore, the linear system is better; the similarity 
of the input and error records indicates that the output is essen- 
tially stationary. If the input is regarded as a signal which the 
system should follow, the programmed system does a better job 
when the input frequencies are low and a poorer job otherwise. 

A few tests with superimposed ramp and random inputs, or 
sinusoids of different frequencies and amplitudes, showed that 


TRANSACTIONS OF THE ASME 


the output of the programmed system can be predicted in a 
qualitative way by a superposition of the responses to the sepa- 
rate input components. This behavior might be explained by 
the fact that a feedback system is more nearly linear than the 
elements in the forward part of the loop (24). It suggests that 
the programmed system might give very inferior performance 
if subjected to a slowly varying input with superimposed high- 
frequency noise components. 

Impulse Torque Disturbances. The effect of impulse torque 
disturbances which impart an initial velocity to the output 
member, correspondingly roughly to the effect of gusts on an 
airplane or missile, were also examined. Sections of the com- 


puter records are shown in Fig. 16, and data from the original 


LINEAR 
c(t) 


RELAY 


0.50 0.75 100 


INITIAL OUTPUT VELOCITY - UNITS 


Fic. 16 Response or System TO ImpuLse Torque 
DisturRBANCE Linear, RELAY, AND PROGRAMMED CONTROL 


records are summarized in Figs. 17 and 18. The programmed 
system has a three or four-to-one superiority in response time 
and maximum output magnitude for the smallest disturbance 
used and a smaller advantage for large disturbances. 

For the programmed system, these records also indicate the 
error which would result from application of a ramp input. This 
is not true for the other systems which used output rate rather 
than error rate in the controller. 

Sinusoidal Torque Disturbances. For step inputs, it is imma- 
terial in principle whether the error acceleration is obtained by 
differentiating the error or by measuring the output acceleration 
or torque. In practice, however, there is a considerable differ- 
ence between the three possibilities for other inputs. Double 
differentiation of the error signal with attendant noise problems 
(13) is avoided by use of output acceleration or torque; this also 
reduces problems associated with differentiation of step inputs. 

For sinusoidal or random torque disturbances (with r = 0), 
the output and error acceleration are interchangeable except for 
sign. As indicated by Fig. 6, the measured torque is only part 
of the total torque acting on the output member and is therefore 
not proportional to the output (or error) acceleration. If 
torque feedback is used, the effect of the disturbance is not felt 
immediately by the controller and rather large output motions 
result, as shown by Figs. 19 and 20. The superiority of the 
programmed system in this case results from the steepness of its 
switching surface near the phase-space origin, leading to rela- 
tively larger corrections for small error or error-rate magnitudes. 

With output acceleration feedback, as shown in Figs. 6 and 9, 
the inner feedback loop operates to keep the net torque small. 
As a result, the output motion is also small, as shown by Figs. 21 
and 22. Since the inner loop in both the relay and programmed 
systems is effectively a rate-limited first-order relay servomecha- 
nism, it can be expected to operate almost perfectly for sinus- 


= 
PROGRAMMED 
ozs 
| 
e(t) 
PROGRAMMED 
| 
rit) 


APRIL, 1957 


RESPONSE TIME - UNITS 
TO OUTPUT 0.025 UNIT) 


a2 


VELOCITY UNITS 


Fic. 17 Response Time FoR Imputse Torque DistTuRBANCES 


L 


002 
006 0.1 0.2 04 06 40 
INITIAL VELOCITY - UNITS 


Fie. 18 Maxmum Error for Imputse Torque DistuRBANCES 


INPUT © 0.35 UNITS (PEAK-TO-PEAK) 


sy 


LINEAR 


HA A 

0.4 
FREQUENCY - CYCLES / UNIT TIME 


Fic. 19 Response or THE THIRD-ORDER SysTeEM TO SINUSOIDAL 
Torque DisturBANces With Linear, RELAY, AND PROGRAMMED 
ContTROL 


521 


soidal torque disturbances whose rate of change does not exceed 
its rate limit. If the disturbance is expressed as 


Tp = M sin wt .. [45] 


Tp = cos wt. . [46] 
and a “critical frequency” w, can be defined as the frequency 
which makes the maximum value of 7'p equal to the maximum 
possible rate of change of torque T’max Or 


Observed critical frequencies are plotted for a number of dis- 
turbance magnitudes in Fig. 23, as well as a theoretical curve 
based on Equation [47]. Since the critical frequency is not 
easily identified experimentally, the agreement is perhaps satis- 
factory. 


PARAMETER TOLERANCES 


The programmed controller for a particular controlled system 
is unique. Given the dynamic behavior of the output member 
and torque source, there is only one switching surface which 
minimizes the response time for all step magnitudes. (The 
controller can be realized in a variety of ways by taking different 
combinations of e, é, and é or related quantities as the inputs 
of the two-variable function generator.) This fact suggests that 
the load parameters must be known with great accuracy and that 
the switching surface must be constructed accurately. Although 
no systematic study of permissible errors has been carried out, a 
few pertinent remarks can be made. 

The electro-optical function generator used in the analog- 
computer work was put into service without thoroughly checking 
its input-output relations against the theoretical switching sur- 
face; limited checks showed that the over-all error in a particular 
torque line, including the drafting errors in the disk, was some- 
times as great as half the distance between lines. On the other 


0.7 
INPUT © O35 UNIT (P-P) 


TORQUE FEEDBACK 


LINEAR 


ERROR ( PEAK - TO- PEAK)~ UNITS 


08 | 


FREQUENCY CYCLES / UNIT TIME 


Fie. 20 Response or THrrRD-OrpDER SysTeM TO SINUSOIDAL TORQUE 
DisturBANces Wits Linear, ReLay, AND PRoGRAMMED CONTROL 
Usine Torque 


en 
a 
= 
| 
as 
0.1 os 


INPUT = 0.35 UNITS (PEAK-TO-PEAK) 
— 


LINEAR 


o2 o4 
FREQUENCY - CYCLES / UNIT TIME 


Fic. 21 Response or System To SINUSOIDAL TORQUE 
DistuRBANCES Wirth Linear, RELAY, AND PROGRAMMED CONTROL 
Usine AccELERATION FEEDBACK 


INPUT * 0.35 UNIT (P-P) 


ACCELERATION FEEDBACK 


Nn 


8 


Linear 


2 
° 


: 
0.18 
: 


y—RELAY 


0.2 0.4 0.6 
FREQUENCY CYCLES/ UNIT TIME 


Fic. 22 Response or TH1RD-ORDER SysTEM TO SINUSOIDAL TORQUE 
DistuRBANCES WitH LINEAR, RELAY, AND PROGRAMMED CONTROL 
Usine ACCELERATION FEEDBACK 


hand, the observed response curves of Fig. 11 are in good agree- 
ment with Fig. 3 and Equation [13]. 

In making minor adjustments on the computer, it was noted 
that drifts at the function-generator inputs caused different 
responses to positive and negative inputs; this effect is caused by 
translation of the switching surface along the error and error 
rate axes. A drift at the function generator output likewise 
shifted the switching surface. Misadjustment of an amplifier 
gain following the function generator resulted in either oscillatory 
or overdamped responses. The system remained stable when 
these errors were small and the deterioration of the step response, 
although noticeable, was not gre... 

In this connection, it may be noted that the relay controller is 
a crude approximation to the programmed controller in which 
the proper switching surface is replaced by a plane. Misadjust- 
ments of the programmed system can be considered to distort 
the switching surface; provided that the distorted surface is 
intermediate between the two surfaces of Fig. 10, the response 


TRANSACTIONS OF THE ASME 


should likewise be intermediate between the programmed and 
relay systems. 

In two experiments, a torque component proportional to the 
output velocity 


was included; the viscous friction coefficient f was 0.1 and 0.5. 
With no change in the switching surface, only slight deterioration 


“CRITICAL FREQUENCY"~ CYCLES / UNIT TIME 


02 04 O6 | 
INPUT (PEAK - TO-PEAK)~ UNITS 
Fic. 23 Critica, Frequency FoR REesPONSE OF PROGRAMMED 


System Usinc AccELERATION FEEDBACK TO SINU- 
SOIDAL Torque DisTURBANCES 


of the step response was observed. The correct switching surface 
for an output member having inertia and friction was computed 
and found to be generally similar to the one used in these ex- 
periments, the region between 7’ = + 1 and 7’ = —1 being nar- 
rower and closer to the error rate axis. 

No experiments were conducted to determine the effect of out- 
put backlash, relay threshold, operating time or hysteresis, or 
other similar effects. 


CONCLUSIONS 


A programmed controller designed to minimize the response 
time of an idealized third-order system has been realized success- 
fully for analog-computer investigations using an electro-optical 
two-variable function generator. The expected superiority of 
this type of control for step inputs was observed, and it was also 
found that the system behaved well for sinusoidal inputs and 
disturbances of low frequency and/or amplitude. 

Some anomalous behavior, consisting of a type of subharmonic 
response, was discovered for sinusoidal inputs of large frequency 
and/or amplitude; the practical importance of this behavior 
could only be judged with reference to a specific situation. 

A limited amount of direct and indirect evidence indicates 
that the required parameter tolerances in a programmed system 
are not as severe as was previously supposed. The effect of vari- 
ous parasitic nonlinearities on system performance was not 
investigated, and the ranking of the various modes of control is 
therefore not conclusive. Additional investigation of these and 
other matters would be necessary if programmed control theory 
were to be reduced to practice. 


NOMENCLATURE 
The following nomenclature is used in the paper: 
c = controlled variable, or system output (e.g., airplane 
heading) 


c = nondimensional system output 


522 
(se) 
| | 
PROGRAMMED 4 
THEORY 
04 ba 
0.35 
0.2 


APRIL, 1957 


error 
viscous friction coefficient 
function of 
moment of inertia 
gain constant 
acceleration gain constant 
velocity gain constant 
peak amplitude of periodic torque disturbance 
magnitude of input step 
Laplace operator 
= torque applied to the controlled system 
= computed torque 
disturbance torque applied to system 
maximum value of torque 
time 
nondimensional time 
system response time 
time at which last switching occurs 
phase-space co-ordinates 
difference between computed and applied torque 
time required for torque to go from one extreme velue 
to the other 
time constant (e.g., field time-constant of a d-c servo- 
motor) 
angular frequency 


on] 


BIBLIOGRAPHY 


1 ‘Automatic Control System for Vehicles,” by H. G. Doll, 
Patent 2,463,362. 

2 “Textbook of Servomechanisms,” by J. C. West, English 
Universities Press Ltd., London, England, 1953, p. 219. 

3 ‘Nonlinear Techniques for Improving Servo Performance,” 
by D. McDonald, Proceedings NEC, vol. 6, 1950, pp. 400-421. 

4 “A Phase-Plane Approach to the Compensation of Saturating 
Servomechanisms,”’ by A. M. Hopkin, Trans. AIEE, yol. 70, 1951, pp. 
631-639. 

5 “The Stabilization of On-Off Controlled Servomechanisms,” 
by A. M. Uttley and P. M. Hammond, “Automatic and Manual 
Control,’’ Academic Press, New York, N. Y., 1952, pp. 285-307. 

6 ‘A Topological and Analog Computer Study of Certain Servo- 
mechanisms Employing Nonlinear Electronic Components,” by 
R. C, Lathrop, PhD thesis, University of Wisconsin, 1951. 

7 British Patent 29011, J. C. West, 1951. 

8 Report No. 469, by D. W. Bushaw, Experimental Towing 
Tank, Stevens Institute of Technology, Hoboken, N. J., 1953. 

9 “Optimization of Nonlinear Control Systems of Means of 
Nonlinear Feedback,” by R. S. Neiswander and R. H. MacNeal, 
Trans. AIEE, vol. 72, part II, 1953, pp. 262-272. 

10 ‘An Investigation of the Switching Criteria for Higher Order 
Contactor Servomechanisms,” by I. Bogner and L. F. Kazda, 
Trans. AIEE, vol. 73, part II, 1954, p. 118. 

11 “Nonlinear Optimization of Relay Servomechanisms,"’ by 
L. M. Silva, Technical Report, Electronics Research Laboratory, 
University of California, Berkeley, Calif., April 15, 1954. 

12 “Nonlinear Control of a Saturating 3rd-Order Servomecha- 
nism,” by J. L. Preston, Technical Memorandum No. 6897-TM-14, 
Contract NOrd 11799, Servomechanisms Laboratory, M.I.T., Cam- 
bridge, Mass., May 24, 1954. 

13. Discussion of Reference 10, R. Oldenburger. 

14. “Effects of Friction in an Optimum Relay Servomechanism,” 
by T. M. Stout, Trans. AIEE, vol. 72, part II, 1953, pp. 329-336. 

15 “Graphical Solution of Some Automatic Control Problems In- 
volving Saturation Effects With Application to Yaw Dampers for 
Aircraft,”” by W. H. Phillips, NACA Technical Note 3034, October, 
1953. 

16 ‘Use of Nonlinearities to Compensate for the Effects of a 
Rate-Limited Servo on the Response of an Automatically Controlled 
Aireraft,”” by 8. F. Schmidt and W. C. Triplett. NACA Technical 
Note 3387, January, 1955. 

17 “Sinusoidal Techniques Applied to Nonlinear Feedback 
Systems,”” by E. C. Johnson, Symposium on Nonlinear Circuit 
Analysis, Polytechnic Institute of Brooklyn, Brooklyn, N. Y., 1953; 
pp. 258-273. 


523 


18 ‘Analysis and Design Principles of Second and Higher-Order 
Saturating Servomechanisms,”” R. E. Kalman, AIEE Technica! 
Paper 55-551, 1955. 

19 “Synthesis of ‘Optimum’ Transient Response: Criteria and 
Standard Forms,”’ by D. Graham and R. C. Lathrop, Trans. AIEE, 
vol. 72, part II, 1953, pp. 273-286. 

20 ‘Introduction to Nonlinear Mechanics,” by N. Minorsky, 
Edwards Brothers, Ann Arbor, Mich., 1947. 

21 ‘Nonlinear Vibrations,’ by J. J. Stoker, Interscience Publish- 
ers, New York, N. Y., 1950. 

22 “A Differential-Analyzer Study of Certain Nonlinearly 
Damped Servomechanisms,” by R. R. Caldwell and V. C. Rideout, 
Trans. AIEE, vol. 72, part II, 1953, pp. 165-170. 

23 “The Mechanism of Subharmonic Generation in a Feedback 
System,” by J. C. West and J. L. Douce, Proceedings of the Institute 
of Electrical Engineers, vol. 102, 1955, part B, p. 569. 

24 ‘Network Analysis and Feedback Amplifier Design,’’ by H. W. 
Bode, D. Van Nostrand Company, Inc., New York, N. Y., 1945. 

25 ‘A Study of a Predictor-Type Air-Frame Controller Designed 
by Phase-Space Analysis,’’ by A. M. Hopkin and M.Iwama. AIEE 
Paper 56-198. 


Discussion 


Rurus OLpENBURGER.’ The Society is fortunate to have a 
paper by the author, who appears to be the first person to use 
switching functions for optimum transient response, and the co- 
author, who has contributed much to the science of the field. 
The work of these men, as well as that of other scientists in the 
servomechanism field, has been of inestimable value to both 
servomechanism and regulator engineers. Although the servo- 
mechanism specialist focuses his attention to a large extent 
on the problem of making an output follow an input, whereas the 
regulator expert concentrates on keeping the controlled quantity 
constant or nearly constant in value, the servomechanism special- 
ist must concern himself with load disturbances, and the regulator 
expert with changes in the set point of the controller. The 
automatic control field owes much to the pioneering effort of the 
servomechanism scientist who has often led the way. 

The optimum transients found by the writer in July, 1944, when 
he began working on the approach of the authors’ paper, were for 
the system 


Je’ + at @ 
where J, a, and @ are constants and 
| m’ | < ky 


for a constant k;. Here cis the rpm of an airplane engine and m 
is propeller-blade angle. The primes denote derivatives with re- 
spect to time. The relation [50] means that the propeller servo 
speed is limited. Equation [49] is a torque equation, where J is 
the moment of inertia of the propeller shaft and connected parts 
and ac is a damping term. The system of Equations [49] and 
[50] is identical with that of the authors described by Equation 
[1], except for the damping term given by ac; i.e., Equation [49] 
corresponds to what the servomechanism experts term a “‘second- 
order’ system, and which would then be written as 


where m’ is bounded. The writer showed thit the best transients 
are obtained by letting the propeller servo travel at maximum 
speed or zero speed at all times; i.e., we have m’ equal to +h; 
or zero. In the normal servomechanism equivalent m’ would be 
the torque T which takes on a maximum, minimum, or zero value 
at all times. 

The switching surface given by Equation [28] of the paper for 
the system with Equation [23] is obtained by eliminating the 
variables ¢ and ¢, from Equations [27]. In order to reach equi- 


* Director of Research, Woodward Governor Company, Rockford, 
Ill. Mem. ASME. 


Zz, Y,2 
€ 
Ts 
WwW 
| 


524 


librium along a trajectory as described by Equations [25] and 
[27] it is necessary that one be on the switching surface in phase 
space. However, to reach equilibrium in an optimum manner, 
one must follow the correct curves on this surface, as traced back 
from the equilibrium point through Equations [25] and [27]. 
Thus if one has the initial conditions 


e=e’’ = Q, e’ = 


and one assumes that one has a transient where | 7’ | < 1, then 


equilibrium is attained with two switchings involving the func- 
tions 


=: =e’ + 2 


for the absquare {e’’} of e’’; i.e. 

{e’’} 
The function 2; comes directly from Equation [28], whereas 2, is 
obtained by eliminating ¢, from the first and second Equations 


{26}. To obtain the optimum transient we use e’’’ = —1 for the 
first phase (see Fig. 24 of this discussion). 


Fic. 24 


When 2; becomes zero we switch toe’’’ = +1. Finally, when 
we reach 2, = 0, we let e’’’ = —1 until equilibrium with e = e’ 
=e’ = Qisreached. Then we switch toe’’’ = 0. The use of two 
switching functions is implied in the paper; i.e., the first switch 
puts one on the switching surface, and the second switch brings 
one from the Trajectory [27] to Trajectory [25]. However, once 
one is on the switching surface one is automatically led by the 
authors’ method along an optimum trajectory to equilibrium. 
The writer and the authors are in complete agreement but are 
looking at the same thing from slightly different points of view. 

If we introduce 


6 
obtained by eliminating ¢ from the first and third Equations [25], 
then 2, = L; = Oimplies that 2; = 0. The Trajectory [25] is one 
for which 2, = 2; = 0. 


TRANSACTIONS OF THE ASME 


The case just discussed is a limiting one for the single-lag sys- 
tems (called “third order’’ by the authors) in the writer’s ASME 
paper 56—IRD-13 entitled “Optimum Nonlinear Control.’’ The 
writer has not carried through the work, but thinks that control 
can here be based on the sign of one function as he has proved 
holds for single-lag systems. 


V. C. Ripgovt’ anp M. G. Spooner.’ The authors are to be 
congratulated on this thorough study of the third-order nonlinear 
servomechanism, and on their able combination of analytical and 
computer methods of attack upon this important problem. 

The design of feedback systems in which nonlinearities are 
purposely exploited is still in its infancy, however. One evidence 
of this‘is the absence of compact criteria for comparison of such 
systems with one another and with linear systems. The authors, 
like other investigators* * have emphasized response to unit step- 
functions, although some attention also has been given to other 
wave forms and to rather vaguely defined random imputs. Their 
results bear out the well-known fact that nonlinear-system re- 
sponse to one type of input (or disturbance) may give no clue as 
to the response to another type of input. 

In the rather common cases where inputs are statistically de- 
fined, and cannot be expressed in terms of semi-isolated step- 
functions or quasi-steady sinusoids, it would appear that correla- 
tion functions, which yield information as to mean-square error 
and delay, might serve more succinctly to state the degree of ad- 
vantage of torque-switching servos over the corresponding linear 
servo. Some efforts” along these lines have been very useful in 
determination of equivalent linear systems, and at Wisconsin we 
have attempted"! to show how this approach might be used to 
yield compact data on nonlinear-system performance. Further 
work along these lines is being pursued with the aid of a high- 
speed analog correlator.'? It would be interesting to know if the 
authors have considered the use of such methods of study of their 
third-order nonlinear systems. 


J.C. West.'* The authors are to be congratulated on making 
this bold attempt to utilize a nonlinear function generator which 
is a function of two variables in a closed loop system. 

There are two factors of great importance: 


1 The operational results are not sensitive to deviations of 
the practical two variable function generator from the theoretical 
values. 

2 Although based on step or transient-response optimization 
there is improved performance in other respects, i.e., for stochas- 
tic inputs and for sinusoidal signals of varying frequency. 


It is interesting also as a confirmation of some theoretical work 


7 College of Engineering, University of Wisconsin, Madison, Wis. 

***Research in Nonlinear Mechanics as Applied to Servomecha- 
nisms,’’ by P. E. Kendall and Irving Bogner, WADC Technical 
Report 53-521. 

*“A Differential-Analyzer Study of Certain Nonlinearity Damped 
Servomechanism,’’ by R. R. Caldwell and V. C. Rideout, Trans. 
AIEE, vol. 72, part II, 1953, pp. 165-170. 

10 ‘Nonlinear Servomechanisms With Random Inputs,”’ by R. C. 
Booton, Jr., DACL Report No. 70, Massachusetts Institute of Tech- 
nology, Cambridge, Mass., August 20, 1953. 

“The Use of Correlation Techniques in the Study of Servo- 
mechanisms,’’ by T. M. Burford, V. C. Rideout, and D. 8. Sather, 
Journal of the British Institution of Radio Engineers, vol. 15, May, 
1955, pp. 249-257. 

12‘‘A High-Speed Correlator,’’ by H. Bell and V. C. Rideout, 
Transactions of the Institute of Radio Engineers, (Electronic Com- 
puter Group), EC-3, No. 2, June, 1954, p. 30. 

13 Professor, Department of Electrical Engineering, Manchester 
University, Manchester, England. 


as 


APRIL, 1957 


by Bogner and Kazda' which shows that for an nth order dif- 
ferential equation the system must take n—1 reversals to bring 
error and derivatives of error to zero in the smallest time. 

Work with a similar aim but approached in a different manner 
has been carried out by Coales and Noton™ who have used predic- 
tion and a fast analog simulator built into the system in order 
to determine correct switching times. 

The anomalous behavior of the system at high frequencies is 
of interest since the writer has made a study of these subharmonic 
phenomena. It has been found in general that for systems where 
the third subharmonic can be obtained it is also possible to obtain 
higher-order subharmonics'® —up to the 11th—by suitably in- 
creasing the input-driving frequency and increasing the ampli- 
tude. The range of amplitude and frequency over which the 
phenomenon is exhibited gets progressively smaller the higher 
the order of the subharmonic. 

The writer should also like to ask if the authors found any 
“jump” phenomena in obtaining the fundamental frequency re- 
sponse, and if any anomalous behavior was observed under 
random testing conditions. 


AuTHors’ CLOSURE 


It is gratifying to the authors to find that Dr. Oldenburger 
and Dr. West are in agreement with them on the number of 
switches required to execute an optimum transient in a third- 


1* Reference (10) of the paper. 

On-Off Servomechanism With Predicted Change-Over,”’ by 
J. F. Coales and A. R. M. Noton, Proceedings of the Institute of 
Electrical Engineers, 1956, Paper No. M.1895. 

6 Reference (23) of the paper. 

7 **The Dual Input Describing Function and Its Use in the Analy- 
sis of Non-Linear Feedback Systems,”’ by J. C. West, J. L. Douce, 
and R. K. Livesley, Proceedings of the Institute of Electrical Engi- 
neers, vol. 103, 1956, part B, Paper No. M.1877. 


525 


order system. A fairly general proof for the number of switch- 
ing operations required, in agreement with previous assertions, 
has recently been given in a paper by Bellman, et. al." 

The authors, in their approach to this study, were initially 
most concerned with the criterion of response time following a 
step change of input or torque, since the design philosophy in 
this mode of nonlinear control is explicitly such as to minimize 
this response time. The authors did not make use of correlation 
functions to characterize the system performance; within the 
limited scope of the computer study reported on here, which was 
undertaken without reference to many of the considerations 
affecting the design and performance of a real system, the use 
of such sophisticated criteria as correlation functions did not 
appear to be justified. Furthermore, this study encourages the 
opinion that optimization of a nonlinear control with respect to 
one criterion does not necessarily mean that the control is 
“optimum” with respect to all other criteria. 

Dr. West appears to have made a more thorough study of 
subharmonic phenomena than did the authors; jump phenom- 
were not observed when the fundamental frequency 
The ‘anomalous’? behavior observed 


ena 
response was measured. 
under quasi-random conditions is illustrated in Fig. 15: As with 
sinusoidal inputs, when the rate of change of input exceeds the 


anomalous behavior may be 
The term anomalous merely implies that the theory 
of these phenomena is not completely understood. 

Now that nonlinear control systems of this type have been 
examined in some detail and their principal characteristics are 
better understood, the authors hope and expect that their sphere 
of practical application will develop rapidly. 


capacity of the torque source, 
observed. 


18 “On the ‘Bang-Bang’ Control Problem,’ by R. Bellman, I. 
Glicksberg, and O. Gross, Quarierly of Applied Mathematics, vol. 14, 
April, 1956, pp. 11-18. 


: 
« 


Optimum Nonlinear Control 


By RUFUS OLDENBURGER,' ROCKFORD, ILL. 


This paper is concerned with the response of a controlled 
system after an initiating disturbance has died out. 
Such a transient is obtained, for example, when the load 
on a prime mover is suddenly rejected or the speed setting 
of an engine governor is instantly switched to a new value. 
It is assumed that the rate of change of the controlling 
variable with respect to time is bounded, and that the 
maximum rate of change can be obtained arbitrarily. 
Thus the speed of a hydraulic governor servo is limited. 
The best return to equilibrium (minimum over or under- 
swing, minimum duration of the transient, and so on) 
can be obtained under rather general conditions by having 
the servo or its equivalent travel only at maximum or zero 
speed. Control functions exist which give the optimum 
transients. These functions are nonlinear. The results 
of theoretical studies to enable the control designer to ob- 
tain optimum or nearly optimum transients are given here 
along with practical compromises. All results have been 
verified in the laboratory with physical devices (governors) 
of various kinds and automatically controlled systems. 


INTRODUCTION 


CONTROL system must be stable. It is natural that 
A much concern in the past has been with the problem 

of stability.* When a controller gives unstable perform- 
ance the customer will simply not buy it. The use of derivative 
played an enormous role in the solution of the problem of sta- 
bility. This problem is normally a linear one. Recently atten- 
tion has been focusing on the problem of the quality of the con- 
trol. Substantial improvement can be obtained in many areas 
by the use of nonlinear control. 

To designate the class of controlled systems treated in this 
paper, the author will sometimes restrict himself to speed- 
governed prime movers. The results hold equally well for the 
control of temperature and other variables, 

Consider a physical system with a single controlled variable. 
We consider the response of this system to a given disturbance, 
where after the disturbance dies out the system attains an 
equilibrium state S. We let m denote the deviation in the value 
of the controlled variable from its value in the state S. Simi- 
larly, let c be the deviation in the value of the controlling varia- 
ble from the value it has for the state S. For the state S we 
then have m = c = 0. Let P(D) and Q(D) be polynomials in 
the derivative D with respect to time where the constant terms 
are different from zero. Let e~"e? denote the dead-time operator, 
and let g(c) be a monotonic nondecreasing function of ¢ [this 
means that as c increases, the function g(c) does not decrease]. 
By definition, given a function c(t) of time ¢ 


c(t) = — 74) 
1 Director of Research, Woodward Governor Company. Mem. 
ASME. 

Contributed by the Instruments and Regulators Division of THe 
AMERICAN Society OF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Notre: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Janu- 
ary 12,1956. Paper No. 56—IRD-13. 


Let f(m) be a monotonic nondecreasing function of m, and let 
the prime on m’ denote the derivative dm/dt of m with respect to 
t. For systems in a single controlled variable the nonlinear 
theory developed by the author is devoted to those with the 
differential equation 


P( 
m' + f(m) = QD) — L. (1) 
where L is a given (forcing) function of time. The polynomial 
P(D) corresponds to leads and the polynomial Q(D) to lags. 
Equation [I] may be considered to be a torque equation where 
m’, f(m), [P( D)e-*e? /Q( D))\g(c), and L correspond to the inertial, 
damping, driving, and load torques, respectively, m being the 
revolutions per minute (rpm) of the prime mover and c the servo 
(throttle or equivalent) position. The coefficients in Q(D) are 
assumed to be positive, since otherwise Equation [I] represents 
an unstable system. All of the applications encountered by the 
author in the speed-governor field are covered, at least to a first 
approximation, by Equation [I]. In most of the theory relating 
to Equation [I] the polynomial P(D) is assumed to be a nonzero 
constant. 

We suppose that the controlled system is at a given instant 
in a nonequilibrium state, and that the system thereafter reaches 
equilibrium while L is constant. It is assumed that c’ is bounded. 
In the hydraulic speed-governor field this means the servo speed 
is bounded, which is always the case. It is also assumed that 
c’ can arbitrarily be made to attain its maximum or minimum 
value, which is true for practical purposes in the case of hydraulic 
servos. Under rather general conditions the best transients in 
every sense (minimum over or underswing, minimum duration, 
etc. ), ie., optimum control, can be obtained by having c’ take on 
only its maximum and minimum values and zero. In the case 
of speed governors this means that the servo should be per- 
mitted to travel at full speed or zero only. 

Optimum control for systems with Equation [I] is not at- 
tained by having the servo travel at full or zero speed in the case 
where P(D) is different from a constant. However, if the linear 
term in P(D) is missing, satisfactory, or nearly optimum tran- 
sients, may often be attained by having the servo travel at full or 
zero speed. When water hammer is accurately taken into ac- 
count in the control of hydroturbines the polynomial P(D) con- 
tains a linear term. Optimum transients are also not attained for 
a class of initial conditions unlikely to occur in practice, where 
lags are involved and m is approaching its equilibrium value at a 
numerically large rate m’. 

Control functions depending on m and on the results of mathe- 
matical operations performed on m can be employed to yield 
optimum transients. These control functions are nonlinear, 
and are used to determine the values assigned to c’ during a 
transient. Whether or not nonlinear control will be employed in 
specific cases depends on performance, engineering, and economic 
considerations. 

In July, 1944, the author wished to determine how far the 
Woodward governors in use on the Hamilton Standard airplane 
propellers deviated from ideal ones. He immediately found 
that the best transients could be obtained by making the servo 
travel at full speed at all times. That is, for a sudden throttle 
burst or other instantaneous disturbance the servo should travel 
at full speed in one direction, then change at the correct instant 


527 


528 


to full speed in the other direction, and so on. The author ob- 
tained the ideal transients and established that the performance 
of the governors then in use was not too far from ideal, and that 
they were operating very satisfactorily. 

Later the author developed the general theory of control where 
during transients the servo travels at full speed a maximum 
amount of time, the direction of motion being controlled by func- 
tions chosen to give optimum performance. The theory was 
checked in the laboratory in all of its details by the construction 
of controllers operating according to the desired functions. 

We shall introduce the term abszuare of a number zx to mean 
the quantity |z|z. Here |z| designates the absolute value of z. 
We denote the absquare of z by {x}. Thus the absquare of x 
is the signed square of z; i.e., {z} = 2? when z is positive, and 
—z* when z is negative. For the simplest case of a controlled 
system, with no lags, the control function for optimum per- 
formance involves the absquare of the derivative m’ of the 
deviation m of the controlled quantity. When the controlled 
quantity is prime-mover rpm, the quantity m’ is prime-mover 
acceleration. A portion of a transient during which c’ is con- 
stant, is termed a phase. For every system covered by Equa- 
tion [I] with P(D) equal to a constant, no dead time (7, = 0), and 
f(m) neglected [neglecting f(m) is justified as theory shows], the 
absquare also occurs in the control function whose vanishing 
determines that c’ should change sign at the start of the last phase 
before m reaches equilibrium (see Appendix 2). The same is 
true when the dead time is included and large load rejections 
are treated (Appendix 1). Because of these and other con- 
siderations it appears to the author that the absquare of the 
derivative m’ may be the next element to be often, if not gener- 
ally, added to control functions. 

In so far as we know the first automatic control was the Watt 
governor. This control was proportional, then integral and de- 
rivative were added to controllers, and it now appears that the 
absquare may be employed. Finding that f(m) could be neglected 
was quite important, since neglecting f(m) made an enormous 
simplification in the theory. 

In 1950 Donald McDonald (1)? published the control function 
which yields optimum transients in the simplest case of a system 
with no lags or leads. In this paper he used a heuristic phase- 
plane argument. In 1952 he published another paper (2) on the 
subject. This was a phase-plane study of torque saturation. 
In 1951 there appeared a paper by A. M. Hopkin (3) on the re- 
sults of an empirical phase-plane analysis of a servo with limiting 
or saturation. A computer study of the subject was made by 
Richard C. Lathrop as part of his PhD thesis (4) the same year. 
At the Cranfield Conference in England in 1951, A. M. Uttley 
and P. H. Hammond (5) treated the switching of an on-off servo, 
employing the absquare. In a PhD thesis in 1953 Donald 
Wayne Bushaw (6) made a topological study of transients for 
the case of torque saturation controlled by switching functions, in 
particular, linear ones. Lawrence M. Silva studied switching func- 
tions to enable the servo to travel at full speed, and used an energy 
approach to find these functions (7 to9). Irving Bogner and Louis 
F. Kazda (10) investigated the switching criteria for higher-order 
servomechanisms using the phase-plane approach, and showed 
that in general the number of switchings for optimum transients 
increases by one as the order of the controlled system goes up by 
one. Further work on higher-order servomechanisms was done 
by 8. S. L. Chang (12) and R. E. Kalman (13). Servomecha- 
nisms with friction were treated by Louis F. Kazda (14) and T. M. 
Stout (15). 

All of these contributions touch on the nonlinear theory devel- 


2 Numbers in parentheses refer to the Bibliography at the end of 
the paper. 


TRANSACTIONS OF THE ASME 


oped by the author which for proprietary reasons has been kept 
confidential. The mathematics of this theory is far beyond the 
limits of this paper. However, for the benefit of the reader, the 
complete fundamental theory is given for the case of the simplest 
controlled system. It is proved that a best transient exists for 
any set of initial conditions, and that a control function 2 exists, 
such that m is brought to equilibrium along the best transient 
by having c’ take on its minimum value when © is negative, its 
maximum when & is positive, and zero when > is zero. It is 
hoped that this treatment will give the reader confidence in the 
approach, and indicate the questions that must be answered to 
establish the theory for more complicated systems. Without a 
rigorous theory one cannot be sure that one has the right control 
functions and that a competitor will not do better. For more 
complicated systems the mathematical results are given without 
proof. The author feels that from these considerations the 
reader can obtain a fairly complete picture of why and where the 
nonlinear approach improves the quality of the control, and 
where the use of nonlinear terms is justified. Linear approxi- 
mations are often satisfactory. The reader also should learn 
what compromises of the theory should be made in practice to 
preserve performance but reduce costs. 

Controls in present use involve nonlinearities. Thus in many 
controllers c’ takes on its maximum or minimum value when one 
is outside of a “control band” based on the value of m. Instead 
of these “unintentional” nonlinearities the correct ones for de- 
sired optimum performance should be built into the control. 

It appears to the author that the development of new controls 
will proceed as follows: The correct control functions for opti- 
mum performance will be established by mathematical theory. 
The inherent complexity of this theory indicates that this will 
require the services of expert research mdthematicians. After 
the correct control functions or reasonable compromises have 
been established the development engineer will incorporate them 
into practical devices. 


Con trout 


We shall first develop the theory for the simplest case and then 
proceed to more complicated problems. The simplest system is 
one where 


for the controlled quantity m, controlling variable c, and con- 
stant K,. As noted, m’ is the derivative given by 


. [2] 


In practice the rate of change c’ of the controlling quantity is 
limited ; i.e. 


for a constant Kz and absolute value jc’, of c’. 
As an example we may have 


m = engine rpm 
c = throttle position 


for a prime mover, where these quantities are deviations from 
equilibrium values. We use the term “throttle” in a general 
sense to denote rack position on a diesel engine, gate co-ordinate 
for a hydraulic turbine, fuel-valve position for a gas turbine, 
steam-valve position for a steam turbine, or throttle position for 
a gasoline engine. Equation [1] now says that the engine acceler- 
ation m’ is proportional to throttle position (deviation from equi- 
librium). When the engine speed is subject to automatic control 


= 
d 


APRIL, 1957 


the servo is connected to the throttle. It is therefore convenient 


to take 


c = servo position 


whence Equation [1] says that the engine acceleration is pro- 
portional to servo position. 

Relation [3] says that the servo speed c’ is bounded by Ko. 

For example, in the case of the General Motors diesel GM71, 
driving a direct-connected alternator, with l-in. servo movement 
from no-load to full-load, Equation [1] is (approximately) 


m' = 600c... [4] 


Thus if the servo is at the no-load position c = 0 and if the servo 
is moved suddenly to the full-load position ¢ = 1, the engine 
accelerates at 600 rpm/sec. <A typical Relation [3} for this case 
is 


le’] $ 10...... 


which means that the servo speed is limited to 10 ips; i.e., the 
servo will go from no-load to full-load in '/, sec when traveling 
at maximum speed. 

The condition that the servo speed is bounded is always true 
in practice. The power a servo can consume is limited and hence 
the servo speed is bounded. Sometimes the servo speed is 
limited by other considerations, such as to prevent excessive 
water hammer in hydro-power installations. In the case of 
aircraft-propeller governors, if the propeller changes pitch at too 
fast a rate, the passengers and pilot experience excessive dis- 
comfort. In the case of hydraulic speed governors the servo 
normally can be operated at full speed at will during a transient. 
This is also true, at least approximately, for other types of 
controls. 

The problem of optimum transients is: 
tion 


Given any initial condi- 


t = 0, m = mo, m’ = m’.......... (6) 


how must the servo c be moved so that equilibrium 


m=0 


will be reached in a minimum time with minimum overswing or 
underswing, and minimum area between the m-curve and the 
m = O axis? We shall show that for each set of initial conditions 
there is a unique transient with m-curve T such that acceptable 
criteria of automatic control theory for optimum transients are sat- 
isfied simultaneously. 


OptimuM TRANSIENTS FOR SIMPLE SysTEMS 


We shall prove later that for an optimum transient in the case 
of the System [1] and [3] the servo must travel at all times at full 
speed until equilibrium is attained. Hence for such a transient 


It will be no restriction on the generality of the method to assume 
that 


m = 0 


whence at the start of the transient one is above or on the ¢-axis. 
Let us assume that the m-curve is concave up before equilibrium 
so that we come into equilibrium from above the ¢-axis as shown 
in Fig. 1. The curve I leading to equilibrium is thus concave up 
before equilibrium. Then 


t, t 


Fig. 1 Arc or Curve 


and by Equation [1] the transient satisfies the equation 
m” = K,Ko.. 
The second derivative m”, i.e. 
d?m 
dt? 
is the rate of change of acceleration m’ with respect to time t. 
To solve Equation [9] it will be convenient to make some trans- 
formations in the variables ¢, c, and m. Otherwise the formulas 


will be excessively complicated. We can write Relations [1] 
and [3] as 


(ce) (<) <1......... [10] 
Ki Rk, Ke 
Introducing new variables M and C so that 

M = Ce [11] 
the Relations [10] become 


which are easier to manipulate. We shall suppose that the curve 
T is concave down before the instant ¢ = ¢,; and that equilibrium 
is reached at the instant ¢ = 4. We introduce a new time scale 


T where 
= {— ty 
so that at the start of the arc of Fig. 1 we have T = 0. We note 
this does not change Relations [12] since 
dM dM iC dC 
= (13) 


We shall therefore understand that the primes in Relations [12] 
denote derivatives with respect to 7. Equation [9] now becomes 


[14] 


Let MM, and M,’ denote the values of M and M‘ at T = 0. By 
the theory of differential equations (11) the solution of Equation 
is 

M’ = M,’+T..... 


12 


M = M,+ M,'T + {16} 


When /’ reaches zero we want M to reach zero simultaneously; 


530 


i.e., the minimum point of the are in Fig. 1 should be on the ¢-axis. 
The three possibilities of solutions that emanate from 


T = 0, M =! 
at the point P; are shown in Fig. 2. 


Fie. 2 Arcs ror DirreRent INITIAL SLOPES 


If we leave the point P; with too steep a slope, we obtain the curve 
I, (corresponding to the Equations [15] and [16]) which over- 
shoots the 7-axis giving an overswing accordingly. If the slope 
at P, is too little the corresponding curve T', undershoots the 
T-axis as shown. The slope MV,’ must be chosen so that M = 0 
at the same instant that M’ = 0. By Equation [15] we have 
M’ = Owhen 


Substituting 7 from Equation [18] in Equation [16] and imposing 
the condition that M = 0 when 7’ satisfies Equation [18] we have 


= 


[19] 


M, 
It follows that if 47, and M,’ are not both zero and are related at 
any instant by the Equation [19] and if thereafter we keep 
we shall reach M = 0 at the same instant that /’ = 0. If after 
this instant we keep 


that is, we hold the servo still, the variable WM will satisfy the 
relation 


whence 


so that we remain at equilibrium until the system is disturbed. 

If we come into equilibrium from below the T-axis and the 
servo is traveling in one direction at full speed from 7 = 0 to 
equilibrium, where equilibrium is reached for some positive 
value of 7, we have 


[23] 
for this are, see Fig. 3. 
Replacing M, by —M we obtain 


_ 
2 


M = (). 


from the Condition [19!. 


TRANSACTIONS OF THE ASME 
Thus if 17’ > 0 we have Equation [24], and if M’ < 0, Equa- 
tion [19] with the subscripts on M, and M,’ omitted. These 


equations can therefore be combined into 


1 
M = 0.......... 
}=0 


M =0 
If at any instant M and M’ satisfy Equation [25] the Relation 


[20] or [23] will lead to equilibrium, where Equation [20] is used 
if M > 0, and Equation [23] if M <0. 


Ay 


Fic. 3 Curve Concave Down 


We introduce the notation 2 where 


The derivative of the absquare {x} is 2|z|z’. Differentiating > 
qd 


with respect to 7 we obtain 


2’ = M’ + M'M’ = (1+ M")M’........ [27] 
where the + sign is + when M’ > 0 and — when M’ <0. For 
the ares of Figs. 1 and 3 the sign of M” is opposite the sign of 
M’. Thus for these arcs the + sign in Equation [27] is opposite 
the sign of 4". But 


Hence for the ares of Figs. 1 and 3 we have 
whence = is a constant. Since © = O when equilibrium is 
reached it follows that 


>=0 .. [30] 
for these arcs, whence Relation [25] holds everywhere along these 
arcs. 

Suppose that we have a transient curve I as in Fig. 1, and that 
prior to 7 = 0 we have Equation [23] valid. The curve is then 
concave down before 7 = 0. Keeping Equation [23] valid we 
trace the curve I’ back to a point Q where M = 0 as shown in 
Fig. 4. There will always be such a point Q as the following 
argument shows. Since 


= —1... .. [31] 


to the left of the M/-axis, the slope of the curve I’ extended to the 
left increases by 1 for each unit of time 7 as we travel to the left. 


M i.e. 
P, 
r 
T ' 
T 
1 
M=0 
| 


APRIL, 1957 


Fic. 4 Ipear Transient ABove T-Ax1s 


The slope M’ will thus never reach infinity, and eventually the 
extended curve I will cross the M-axis with a positive slope Mo’. 
The high point on T extended is denoted by A. From Q to A 
we have M’ > 0 whence 
2’ = (1+ M")M' = 0. . [82] 


Since at Q we have 


M’ = M,’>0 


so that = is a constant. 
[33 


it follows that 


4 (M)’)? > 0.. 


[34 | 


From A to P; we have M’ <Oand M” = —1 whence 
[35 


fromQ to A. 
2’ = (1— M’)M’ = 2M’ <0.... 


Thus © is decreasing from the value it has along the arc QA to 
the value zero, which it has along the are P,R. 

The curve shown in Fig. 4 leads from the point Q to the equilib- 
rium point FP with one switching of the direction of motion of the 
servo. Suppose now that the system is disturbed so that at a 
given instant M and M' satisfy Relations [33] where M,’ is an 
arbitrary positive number. The curve of Fig. 4 (normally the 
switch point P; will not be on the /-axis) will automatically lead 
to equilibrium if we take 


C’ = —1, 
Cc’ = +1, Lz 


Suppose now that at a given instant 7, we have 


M=M,>0, M=M,', 2=23,>0 


With M” = —1 we can trace a curve to the left of 7 = 7, of the 
form shown in Fig. 4, until we reach M = 0. Thus the element 
(M, M’) = (M,, M,') can be associated with a curve that starts 
at M = 0 as shown in Fig. 5. The slope M,' at the point P, can 
be either +, 0, or — as long as Relations [37] are satisfied. It 
follows that we can go from P, to an equilibrium point R by 
following a curve as shown in Fig. 4; i.e., by choosing the sign 
of C’ according to Relations [36]. If at P. the slope M,’ is such 
that 2 = 0, instead of > 0, we reach equilibrium along the curve 
for which C’ = 1. We have thus taken care of all initial condi- 
tions where 


M,2 9, 
If 
M, 30, 


we can take a curve, as shown in Fig. 6, through the point P, 
that will lead to the equilibrium point R and when traced to the 
left originates at a point Q where 


Fic. 5 Curve Tracep From Point Pa To Q 


M=0, M’=M)'<0. 
Suppose now that at the point P, of Fig. 7 we have 


It follows that 


From P, we trace a curve I” to the right for which C’ = 1. 
In the Derivative [27] for =’ we have the minus sign and M” = 1, 
whence Equation [29] holds. Since M’ increases by one unit for 
each unit increase in 7', eventually we will reach a point S where 
M’ = 0. But then M < 0 since along 


.. [43] 
It follows that 17 = 0 for some point Q between P, and S. The 


| M 


C'=! 


Fic. 6 Ipeat TRANSIENT BeLtow T-Ax1s 


Fic. 7 Ipeat Transrent Wits OversHoor 


531 
M M 
=>0 
—-----P, 
Cs 
Q R T Q ie) Te T 
| Q Oo R 
T 
C'=-! 
of M. 
= 
M 
Mg 
T 
C=+! P 
Ss 


532 


curve I” is of the type T; of Fig. 2. We can now reach equi- 
librium as in Fig. 6. The entire transient is then as shown in 
Fig. 7, provided that the 7-co-ordinate of the point P, is chosen 
properly. For the curve of Fig. 7 C’ = 1 from P, to P; and 
= —l from P; toR. For the are P,P; we have 


C=1 [44] 


and for 
.. [45] 

Similarly, if 
M, <9, 


we have a curve as shown in Fig. 8, provided that the 7'-co-ordi- 
nate of the point P, is chosen so that the switch point P, is on the 
M-axis. 


Fig. 8 Ipear Transient Positive OversHooT 


We have now treated all initial conditions and shown that for 
all cases 


C’ = <0... 
C’ = when >>0 
whereas 


C’=1 #4whend> = OandM>0O 
C’ = —1 when = = Oand M <0 


and 
C’ = Owhen=> = M =0 
The ideal transients of Figs. 4 and 6 to 8 are determined en- 
tirely by the sign of L except when = = 0. 
Loap REJECTIONS AND SPEED-SETTING CHANGES 


Transients of Figs. 4 and 6 starting at the point Q on the 7-axis 
arise when instant load rejections and increases are made on 
engines. To take care of load we introduce a load term —-/ into 


Equation [1] to obtain 
Kic — [49] 


which can be written as 


Ki Ke 


from which 


TRANSACTIONS OF THE ASME 


when we make the substitution 
l = K, Kol. 
In equilibrium WM’ = 0 whence 


[53] 


and the Transformations [11]. 
C= %.... 


so that the servo takes a unique position corresponding to the 
load L. Dropping the load L is equivalent to replacing Equation 
[51] by 


M' =C.. [54] 


given in Relations [12]. Suppose that we are in equilibrium 


before load rejection so that 
L 
Dropping the load L instantly we have 


M'=tL.. .. (55) 


from Relation [54] at the start of the transient. Hence at the 


initial point Q of the transient we have 


= L... [56] 


Thus the curve of Fig. 4 is the response to an instant load rejec- 
tion. Transients starting at a point off the 7-axis may arise 
when one instant load rejection is followed by another before the 
response to the first rejection has died out. 

Transients starting at a point off from the 7-axis with M’ = 0 
arise when there is an instant speed-setting change. Such a 
transient is shown in Fig. 9 where the transient starts at 7 = 0, 
M =—M.. 

We shall prove in the next section that the curves of Figs. 4, 
and 6 to 9 are optimum curves. 


Fic. 9 Response TO CHANGE IN SETTING OF THE CONTROLLED 
VARIABLE 


For the original variables and instant load rejections the results 
are shown in Fig. 10. 

Ina linear control the duration (properly defined) of a transient 
is independent of the magnitude of the disturbance, and the mazi- 
mum deviation from equilibrium for instant load rejections is pro- 
portional to the magnitude of the disturbance, i.e., to mo’, as shown 
in Fig. 11. Thus if the maximum deviation from equilibrium 
for a given load change is my, and if the load change is halved, 
the deviation is also halved to '/2my, but the maximum still 
occurs at the same instant ty. 

For the nonlinear control of Fig. 10 if we halve the magnitude 
of the load rejection, the deviation my from equilibrium goes to 
1/4my, i.e., as the square of the magnitude of the disturbance, and 


| 
| 
C's-| Pp 
C's! 
Q R Tt 
T 
M 
Cs! 
c l 


APRIL, 1957 


m A 
(my 
| 
| 
mo Vz )m! ¢ 
KK, (+ Ky 
K, K, 
Fic. 10 Ipeat TRANSIENT 
m 
| 
H 
0 ty ty t 
Fic. 12 TRANSIENTS FOR NONLINEAR CONTROL 


the duration of the transient goes down by half, i.e., in proportion 
to the magnitude of the disturbance, as shown in Fig. 12. 

For an instantaneous increase in the setting of the controlled 
variable the optimum curve is shown in Fig. 13. 


Why THE NONLINEAR TRANSIENTS ARE OpTIMUM 
To prove that the “nonlinear’’ transients obtained in this 
paper are optimum, let us consider any state of the system, i.e., 
at any time which we may take to be 7’ = 0, we have 


M = M, . [57] 


given. This means that the vector in Fig. 14 is given. In this 
figure we have drawn the “optimum’’ nonlinear transient T 
as derived in this paper. There is a unique such curve [ leading 
to equilibrium at the point P. In Fig. 14 the curve is concave 
down first (C’ = —1) up to the point P; with T = 7), and then 
concave up (C’ = +1). Consider a trajectory T; that leaves 
the point (0, M,) with the slope M,’ but reaches equilibrium be- 
fore the point P. The curve I’; must then cross or leave the curve 
T at some point P: before the time 7 = T: associated with P. 
This point P: is to the right of the point P; where the concavity 
of I changes. This must be so because the slope cannot change 
faster along a trajectory than along I’ between the initial point 
Po and the point P;. If we start at the point P, and increase the 
slope at the maximum rate we cannot reach M’ = 0 before 7 = 
Tz. The curve [, will be below T at 7 = 7; as shown in Fig. 
15. 


t 
Fic. 11 Transients FoR Linear ConTROL 
m 
A 
2, Me 
KK, 
t 


Fic. 13) Optimum TRANSIENT FOR INSTANT CHANGE IN SETTING OF 


CONTROLLED VARIABLE 


Mh 


M, 


Fx 


P 


| 
C'=-I C'= +1 


Fie. 14 Trasecrorres anp THroves Given Evement (Mo, 
M)’) 


Suppose that the curve I’ undershoots as shown in Fig. 16. 
Any trajectory leaving P» will be below or pass through the 
point P; at the instant 7 = 7), where the concavity changes, 
since the slope along T is increasing at the maximum rate up 
to P,. For a curve I, starting at the point Py with the slope 
M,’ to reach equilibrium before the instant 7 = 7, the curve 
I, must cross or leave [ between P; and the point P where 
T = 7. This cannot be so in view of the argument relative to 
the case of Fig. 14. 

The best transient in every case is thus one where equilibrium 
is reached in one phase with C’ = 1 or C’ = —1, or in two phases 


Fie. 15 Arc With UnpersHOOoT 


Fie. 16 Curve T UnpversHoot 


with a phase C’ = 1 followed by a phase for which C’ = —1, 
or a phase with C’ = —1 followed by a phase with C’ = 


Here, as in the introduction, phase refers to a portion of the 
transient for which C’ is a constant (see Figs. 4, 6 to 10, and 13). 


ConTROL FUNCTIONS 


The problem is now to construct a control which senses M only, 
and quantities such as M’ which depend on M and yields the 
optimum transients. To do this we consider 2 where 


for the absquare { M’} of M’. 
Along the are PA of Fig. 8 (from the initial point to the maxi- 
mum) we have 


[59] 
whence 


+ 1/2 


Along the are AP; from the maximum point A to the point of 
inflection P, we have 


(61) 


whence 


TRANSACTIONS OF THE ASME 


since at R we have 
M = = 0... 


We now use the control 


where K is “infinitely”’ large, i.e. 


C’ = —lfor=>>0 
0 for = 0 
C’ = +lfor> <0 


There is an apparent contradiction between the Schedule [66] 
and C’ = +1 along the arc P,P of Fig. 8. However, in practice 
this is not the case. Consider the inflection point P; where we 
set 


(67) 


in view of Schedule [66]. When Equation [67] holds we have 


=0 


whence M’ is a constant. Since M is decreasing L will “immedi- 
ately” become negative. Then by Schedule [66] we have 
C’ = +1 and we follow the are P;R to equilibrium. 

In any physical situation we would have at least 


for a small positive number € before we would switch from 
Cc’ = —1 to C’ = +1. Since > is a constant along an are for 


which C’ = +1 and M’ < 0, when we reach M’ = 0 we have 


== M =—... 


[69] 


Thus the transient will undershoot as shown in Fig. 17 where we 
have taken the initial point at 7 = 0. The amount of under- 
shoot will thus depend on the deadband in the device that con- 
trols the servo speed. 

Unless a linear zone or equivalent is provided to stabilize the 
system an on-off servo will hunt forever after a sizable disturb- 
ance as can be readily verified by mathematical theory. In 
practical devices built by the author and his associates for the 
simple system under discussion, the schedule given by the Formu- 
las [66] is used except that when M is small in absolute value 
the M-term in ~ is replaced for a constant a by aM and when 
| M’| is small enough the absquare term is replaced by bM’ for 
a constant b so that the function C’ given by 


C’ = —K(aM + bM’) 
yields good characteristic roots when combined with 


For extremely good electronic components, and a system with 
no lag for practical purposes, one can use a value of A as high 
as 200, and still have stable transients for disturbances where 
M = 0, M’ = 1. However, 200 is near the upper limit. See 
Fig. 18 for the case of optimum transients and instant load re- 
jections obtained in the laboratory. Fig. 18 and Figs. 21 to 27 are 
photographs of Sanborn oscillograph traces. The RPM-curve 
of Fig. 18(a) shows the rpm response to instant 25 per cent load 
rejections and load increases, whereas the RPM-curve of Fig. 
18(b) shows the corresponding results for the 50 per cent load 
case. Note that the RPM-overswing for 25 per cent load rejec- 
tion is one fourth as much as for 50 per cent load rejection. Note 
also that there are no RPM-underswings for the load rejections, 
and that the transients for the 25 per cent load case are over in 
one half of the time required for the 50 per cent case. The servo 


534 
R 
0 T 
M | 
> = 
r 
................ 68} 
| 
[71] 
.............. (60) 
and from the point P, to the equilibrium point R we have 


APRIL, 1957 


50% 
LOAD 


SERVO 


Fie. 17) Practica, TRANSIENT Fic.18 
trace of Fig. 18(a) is to a different vertical scale from this trace 
for Fig. 18(6). For Fig. 18(a) the servo goes from the 50 per 
cent load level io the 25 per cent load level, then back to the 
50 per cent level, ete. The levels for Fig. 18(b) are for 75 and 
25 per cent load. 

The optimum physically realizable characteristic roots can be 
determined from theory and the limitations of the equipment. 
The change from M to aM in = can normally be made at M be- 
tween 0.001 and 0.1, and similarly the change from '/2 {M’} to 
bM’ can be made when M’ is.somewhere in the range from M’ = 
0.01 to M’ = 0.1. The place where a change in a coefficient 
is made depends on the magnitude of the disturbances. For the 
case of Fig. 18 the lags in the system and control were very small 
so that without a linear control band with the Function [70] the 
hunt in M was 0.0025 per cent peak to peak. The range of load 
rejections was 0.625 to 50 per cent, i.e., a range of 80:1 so that 
the range in overspeeds was 6400:1, namely, from 0.0078 to 50 
per cent. 

Consider, for the moment, the case where € is infinitely small 
(to be rigorous one should use a limit process here). Suppose 
that the system at any instant is in the state M = My, M’ = 
My’, say, at T = 0. If = > O, then by the Schedule [66] we 
have C’ = —1. Along the trajectory thus obtained 2 will be 
constant if M’ = 0. Eventually, M’ < 0, whence 2 will de- 
crease to > = 0. By the schedule we then have C’ = 0, and 
> becomes negative immediately, whence by the Schedule [66) 
we soon have C’ = +1 and we are brought to equilibrium. At the 
instant when equilibrium is reached we set C’ = 0, whence = 0 
thereafter, at least until the next disturbance; similarly, if 
><OatT =0. 

If = = Oat T = Oand we are not at equilibrium, then 


M,’ 0.... [72] 
and with C’ = 0 immediately 2 becomes + or —, and Schedule 
|66] will lead to equilibrium along an optimum transient. 

Since all transients are optimum when the Schedule [66] is 
used, we obtain ‘‘optimum”’ transients for instant load rejections 
as well as for instant changes in the setting of the controlled 
variable by using this schedule. 


Errect oF DamMpING IN SysTEM 


The same type of mathematical treatment as used in the 
simplest case holds for other cases. Because this theory is ex- 
tremely complicated and beyond the limits of this paper the 
proofs will be omitted although the results will be summarized.* 

Consider now the system with the equation 


* The proofs are on hand but have not been written up for publica- 
tion. 


N\_25% 
75% \ 


——> TIME 


(bd) 


PHOTOGRAPH OF OpTiMUM NONLINEAR TRANSIENTS IN AN ELECTRONIC CASE 


M'+aM =C 


with a@ 2 0 in place of the equation in Formulas [12]. The 
term aM in this equation is a damping term. We introduce the 
controlling function 2 where 


==M+— [74] 
Q 


a? 
Here the + sign is 


plus when M’ <0 ) 
minus when M’ > 0 
either when M’ = 0 


Expanding the In-term of Equation [74] in powers of a! M’| we 
obtain 
3 4 
at} | 


{M}.... 


is the control function for the case with the damping term 
neglected. If |M’| is small the function 2 can thus be replaced 
by 2). In any case it is advisable to use the function 2, of 
Formula [77] in place of the of Formula [74] as if the damping 
term were missing. Theory shows that the presence of the damp- 
ing improves the transients and that the difference in neglecling 
and not neglecting the damping term is often not too great. Simi- 
larly, if the damping term is f(1/’) for a monotonic nondecreas- 
ing function of M’ where 


f(0) = 0 


it is advisable to neglect the damping, since theory shows its 
presence to be beneficial. 

Mathematical considerations show that the optimum transient 
for a load rejection is that given in Fig. 19. Along the are AP, 
we have 


M2 — aM’) 
1— aM’ 
whence = is decreasing. At the point P, (not in general an in- 


flection point) & becomes zero. Schedule [66] still applies, 
however, with the = of Formula [74]. 


535 
RPM 
-€é 7 
where 


Fic. 19 Optimum TRANSIENT FOR System WitH DampING 


/ 
The mazimum value M y of M is given by 


Mo’ In(1 + 


a’ 


My (79] 


From theory we can show that the transient is optimum in every 


sense and unique. The theory holds for transients that start at 
a point where My ~ 0 as well as those where My = 0. 


Sincie-Lac Systems 


We now treat a physical system with a lag between servo and 
torque (in the prime-mover case). This lag is assumed in this 
section to be one associated with a pure time constant. The 
equation of the controlled system is now 


+ M’ =C.. [80] 


instead of the first equation in Formulas [12]. Here 7 is the 
time constant. 

In the simplest prime-mover case the ideal transients are com- 
posed of one or two phases. This is no longer true for systems 


with lags. We introduce the function 2 given by 


(agn nt{1 + V1 — (1 + [sen 4177} 1, 


where 


y= M+rM’.. [82] 


and 

sgn = +1if 
= Oify’=0} 
=—lify’<0} 


. (83) 


The controlling function = for ideal transients involves two 
functions 2; and where 


[84] 


When 


equal 


and &, is the entire expression Y given in Formula [81]. 

=, becomes imaginary, we take the control function 2 
to Z:. For an instant load rejection the two control functions are 
needed to bring one to equilibrium along the optimum curve. 
Here optimum is used in the same sense as before. The ideal 
transient for instant load rejection is shown in Fig. 20 for the 


one lag case. 


TRANSACTIONS OF THE ASME 

Along the first phase OQ, of Fig. 20 we are controlling on the 
basis of Schedule [66] where 

[85] 


Along this are ©; > 0 and C’ = —-1. At the point Q; the function 
2, becomes zero. Ata point E after Q, we have Y’ = 0. Along 
the arc 


(86) 


and C’ = +1. Schedule [66] gives C’ = 0 for this arc. How- 
ever, after the point Q, the function 2, will then become negative 


C's + C's -| 


Mew & 


Fic. 20 TRANSIENT FOR OneE-LAG Case 


immediately whence by Schedule [66] we have C’ = +1. At 
the point E the function 2, becomes imaginary. From the 
point E to Q the function 2, is negative and increasing. At 
the point Q. the function L, becomes zero. From E on we 
control on the basis of the function 22. Schedule [66] with = = 
>: gives C’ = 0 for the are Q.P. However, again with C’ = 0 
the function ~ will hecome positive immediately, whence by 
Schedule [66] we have C’ = —1. We will now be brought to 
equilibrium (at least for practical purposes). 

At the start we have > 0. This derivative decreases to zero 
at a point A. From the point A to the point F between Q, and 
Q. the quantity Y’ < 0. Between the points Q, and £ there is 
the inflection point B, for which M” = 0 occurs. 

From the point E to the equilibrium point P the quantity 
y’ is positive, increasing from the point E to the point Q. and 
decreasing from there to zero at the point P. 

Equilibrium can be reached after any disturbance on the basis 
of Schedule [66] where control is based on the sign of 2, or 2: and 
during a transient the control function alternates (in the proper 
manner) between >; and LY». Equilibrium can be reached on the 
basis of Schedule [66] in four or less phases. For “most” dis- 
turbances three phases suffice. There are initial conditions for 
which the optimum transient is not one where the servo travels at 
full speed. These would, however, be considered exceptional. 
Such cases arise when the M-curve is plunging toward the 7'’-axis 
with a steep slope and reaching equilibrium with maximum servo 
speed would require the phases C’ = +1, —1, +1, —1 in this 
order, or the phases C’ = —1, +1, —1, +1 putting a hump in 
the M-curve that could otherwise be avoided.‘ 


* The complete mathematical theory underlying the foregoing state- 
ments for the single-lag case of Formula [80] is quite complicated. 
It is in the files of the author, but has not been written up for publica- 
tion. 


: 536 
M 
= = A 
Mu 5>0 
| | 
| 
| 
| 
Q, | | 
A 
. 


APRIL, 1957 


Hicuer Lac Cases AND PRACTICAL COMPROMISES 


It can be proved by mathematical considerations that for dis- 
turbances of a magnitude normally encountered the log term in 
Formula [81] is small compared to the rest of 2, and can there- 


fore be dropped. The controlling function is thus 


v 


2 = [87] 
For a controlled system given by 
Tm” +m’ = Ke. . [88] 
and servo speed limitation 
le’| Ke. [3] 
the formula for nonlinear control on the basis of 22 becomes 


c’ = —K [om +m’) + 


. [89 
(89) 


where K is as large as possible. 
For a two-lag case we have the equation 


with time constants 7; and 72. Equation [90] can be written as 


M’ 9] 
(1D + 1) (t2D + 1) (91) 


where D stands for the derivative with respect to time 7’. The 
three controlling functions for this case are quite complicated 
and will be omitted, except for the analog of 2: in Formula [84]. 
We let W be given by 


y = M + (1 + T2)M’ + 172M". 


For the last phase of a transient corresponding to a load re- 
jection the quantity 2, = 0, where >; is as given in Equation [84], 
but with W as in Formula [92]. 

More generally, if the equation of the controlled system is 
given by 


O(D)M’ = C........ [93] 


for an operator 
O(D) = (mD + 1) (72D + 1)... (7,D + 1)..... [94] 


in the derivative D, for each transient the function S. vanishes 
for the last phase, where 2: is given as in Equation [84] and 


= O(D)M.. (95) 


Control on the basis of 2, often gives a good approximation to the 
optimum transients. We can write Equation [93] as 


y'=C.. 


Control on the basis of 2; and this W’ is control on the basis of an 
auxiliary variable; namely instead of M. We make come 
to equilibrium in an optimum manner. The y-system [96] is 
a no-lag system. When equilibrium is reached with y = 0, the 
variable M comes to its equilibrium M = 0 according to the 
differential equation 


[96] 


O(D)M = 0.. 


For the one-lag case of Equation [80] the variable M comes to 
equilibrium exponentially according to the law 


M = 
where M; is the value of M at the instant W reaches equilibrium. 


If in Formula [92] the time constant 72 is small compared to 
7, the Formula [92] can be replaced by the single-lag Formula 
[82] with r = 7, or better, with r = 7; + To. This is not the 
case if T; = T2 (see Fig. 22). 

If a dead time 7, is introduced into Equation [93] so that this 
becomes 


= C. .. 


the dead time may be treated as a time constant unless it is large 
or dominates the other lags in the system (see Appendix 1 for 
some results on systems with dead time). 

The foregoing theory applies if O(.D) has quadratic factors that 
do not factor further (in the field of real numbers), correspond- 
ing to second-order lags. Actually, O(D) in Formula [95] may 
be any polynomial with positive coefficients. 

If a damping term f(./) is included on the left of Equation 
[93] this term can be dropped as in a previous section on systems 
with damping, since it improves the transients obtained on the 
basis of design without it. It is assumed, as in the introduction, 
that f(M) is a monotonic nondecreasing function of M. Dis- 
cussion of coulomb damping will be omitted here for the sake 
of brevity. 

If now we have the equation 


O(D)M = (99) 


for the controlled system, where O,(D) and OD) are poly- 
nomials in D with the linear term missing in O.(D), all of the 
terms on the right of Equation [99] may often be dropped except 
the C-term and an equation of Type [93] used instead, at least 
when applying the nonlinear approach of this paper to obtain 
good, though not necessarily, optimum transients. 

We remark that in the special case 


M’ C 


it is impossible for /’ and C to approach zero (equilibrium values) 
simultaneously while C’ = +1. It follows that the optimum 
transients in this case are not obtained by having the M-curve 
approach the 7-axis with maximum | C’!. 

Because of space considerations the treatment of the case where 
a term in the integral of M occurs on the left in Equation [99] 
will be omitted. 

Where large lags are involved theory and experiments show 
that nonlinear control can be used to reduce servo jiggle that 
arises with linear control and give faster return to equilibrium. 


LINEAR BAND 


With the control Formula [65] for = involving the absquare 


the system may be unstable. As M, M’, etc., become numerically 
small in Y their coefficients may be increased or decreased so that 
when |M| and its derivatives are small, and one is thus near 
equilibrium, the control formula becomes 

—Kz, {100} 
for a sum such as 


2, = aM + bM' + 


where a, b, and e are constants chosen so as to give good stability. 


PracticaL COMPROMISE 
A promising approach for nonlinear control as discussed here 
is that where a system is treated as a one-lag system and the 
second derivative is dropped from Formula [89] in the interest of 
simplification. The control formula is now 


— m Tm’ + | 
2KiK: 


= 
n......... 


538 


where we use actual variables, rather than per unit quantities, 
the equation of the controlled system is given by Formula [88], 
and the servo limitation is given by the Relation [3]. To com- 
pensate for dropping the m”-term it is necessary to increase the 
coefficient of the absquare term so that Formula [102] is replaced 
by 


c’ = —K[m + (r + B|m’| )m’]......... [103] 


for a constant 8. The coefficient of m’ is now a variable, which 
tends to give a more highly damped response for large values 
of |m’| than small ones. In practice the value of 8 is adjusted 
so that 8] m’| max dominates 7 for the maximum absolute value 
|m’| of the derivative m’ to be encountered in practice. Theory 
and experiments show that it is desirable to have | m’| max equal 
to about 107 or 257. The maximum value | m’| max depends on 
the magnitude of the disturbances encountered by the controlled 
system. An experimental transient for a case where Formula 
{103} is used is shown in Fig. 21. The top and bottom curves 


SERVO 


—> TIME 


EXPERIMENTAL NONLINEAR TRANSIENT FOR A CHANGE 
IN THE SETTING OF THE CONTROLLED VARIABLE 


Fie. 21 


are for engine speed and servo position with respect to time. 
This figure shows the response to a change in the setting of the 
controlled variable. 

If in Formula [103] we replace | m’'| by the “average” value it 
has for the larger disturbances we obtain a linear control that is 
at least a rough approximation to the nonlinear. 

In practice it is convenient to choose K and 7 so that the 
characteristic roots are optimum when the absquare term is 
dropped from Formula [103] and then to adjust 6 so that it is as 
large as possible without causing hunting of the controlled system 
after the worst disturbance is made to which the system is to be 
subjected. The use of absquaring as in Formula [103] allows one 
to have very poor characteristic roots when the derivative |m’| 
is large, corresponding to fast movements of the servo, and ideal 
roots when near equilibrium, yielding optimum stability for small 
disturbances. 

To allow for excessive disturbances it is desirable to bound 
the value of |m’| or the term | m’| m’ in Formula [103]. 


DISCONTINUITIES 


In the theory developed so far no allowance has been made for 
the fact that the servo stroke is limited. If the maximum servo 


TRANSACTIONS OF THE ASME 


speed is such that the servo goes through its stroke in a small 
fraction of the time required for the larger transients, so that 
application of the nonlinear theory gives essentially two-position 
control, the nonlinear theory developed here breaks down. For- 
tunately, this is not the case in many, if not most, applications. 
If the servo is not at the ends of its stroke during a major part of 
the transient, the nonlinear performance will normally not deviate 
too much from theory. If the servo speed is not the same for the 
two directions of travel of the servo, but is not too different for 
the two directions (theory and experiment indicate that a 2:1 
variation is acceptable), the transients for nonlinear performance 
on the basis of this paper are still near optimum. The author's 
theory for the two-speed case will be omitted for the sake of 
brevity. 

Tests of variations of all of the constants show that the non- 
linear control treated here is not critical. Variations of 2:1 to 
4:1 in the coefficients of = and the second power used in the ab- 
square can be tolerated. 

AREA OF USEFULNESS 

If the lags in a system to be controlled are small, the range of 
disturbances is large, and the discontinuities are not severe, 
substantial improvement (by an order of magnitude) over existing 
control can be obtained for a range of disturbances with the non- 
linear control described here. Experiments indicate that defi- 
nite improvement can also often, if not generally, be obtained 
when the first two conditions are not satisfied. 

A fundamental limitation, from the theoretical point of view, 
on the use of nonlinear control is that of noise. By ‘noise’ we 
mean the unwanted part of the signal input to the controller. 
This signal contains the measurement of the controlled quantity, 
which is wanted, as well as another portion, which is not wanted. 
In the absquare {m’} of m’ the noise is worse than it is in m’, 
and, in fact, may be almost as bad as in m”. On account of noise 
the gain constant K in Formula [65] is limited, if it is not limited 
for other reasons. It follows that for the nonlinear control to 
be effective the quantities m’ (or Y’) must be large enough for the 
bigger disturbances to dominate the noise. Jf a system is sub- 
ject only to very small disturbances such as those which cause 
M’ to be 0.01 per cent maximum, the noise may prevent the use 
of nonlinear control involving the absquare. 

If the noise is too great the servo jiggle will be excessive. 

If a system has very large lags relative to the servo-stroke time, 
such as a ten-second time constant for a three-second servo, a 
term in M” can be added to the ¥-term in Formula [84], and the 
coefficients in Y can be changed to compensate for dropping the 
y’-term in Formula [84], so that the use of a linear 2 with second 
derivative is a satisfactory approximation to the performance 
that can be obtained by the nonlinear 2 of Formula [84], and the 
employment. of a nonlinear 2 with the second derivative term 
in the absquare is not economically justifiable. However, 
Formula [103] still applies if the second derivative is not used. 

For n sufficiently large, such as n = 10 or 100, a dead time 
Tq may, for practical purposes, be replaced by n first-order lags 
in cascade with identical time constants equal to 7,/n. The 
corresponding nonlinear theory requires the use of derivatives up 
to a high order, such as the 11th or more, which noise makes im- 
possible. This difficulty is avoided in the precise dead-time 
theory of Appendix 1. 

With a linear control function the initial overswing (or under- 
swing) for a given instantaneous disturbance can be made the 
same as would be obtained for the optimum nonlinear control 
with the same servo, and bounded servo speed. However, non- 
linear control with the absquare can be employed to improve 
the response to maximum disturbances by eliminating or reducing 
undesired oscillations (after the first swing), and to reduce over- 


APRIL, 1957 


swings and underswings in the responses to lesser disturbances. 
For small! disturbances a practical control is normally purely 
linear. Thus in the case of prime movers the overswing for an 
instantaneous 100 per cent load rejection may be adjusted to 
be the same for linear and nonlinear control, whereas for instant 
load rejections of less than 100 per cent the overswings are re- 
duced. However, the linear control that gives the same over- 
swing as for a satisfactory nonlinear control may be too oscil- 
latory, so that even for 100 per cent load rejections the non- 
linear control is to be preferred. Essentially, by nonlinear con- 
trol more efficient use is made of the servo. 

Co-operation of the prime-mover manufacturer in reducing lags 
in the system to be controlled can often result in substantial im- 
provement in the control by permitting the use of a nonlinear 
governor nearer to that indicated by the theory for the no-lag 
case. Thus in one application the time constant of a servo 
(supplied by the governor user) due to trapped air was '/, sec, 
when it should have been about '/io sec. The result was a 
different order of magnitude in the speed deviation for the maxi- 
mum disturbance to be encountered. 


RESPONSE TO SINUSOIDAL AND OTHER DiIsTURBANCES 


Consider the equation 
M’ = C —sin wT 
Optimum performance is obtained by letting 
C = sin w7' 


This condition can be achieved by linear control only for discrete 
values of w, and in no case by the nonlinear control of this paper. 
Frequency-response runs on systems based on optimum non- 
linear contro! and on the best linear control show about the same 
results except that at large amplitudes of the forcing sine wave 
the nonlinear control yields a lower resonant frequency where 
the system response has a peak. 

For linear throttle bursts (constant velocity) of 250 to 1000 
rpm in '/, to 3 sec on a simulated 3000-rpm airplane engine, 
connected to a simulated propeller the overspeeds for nonlinear 
control based on Formula [103] were about half of the overspeeds 
obtainable by linear control. In this example the servo lag was 
taken to be 0.02 sec (time constant). 

Space does not permit the inclusion of the author’s theory for 
throttle bursts and other kinds of disturbances. 


SERVO SERVO 


TIME TIME 


Fic. 22(a) Expermmentat Re- Fie, 22(b) 
SPONSE FOR A SinGie-Lac Sys- 


Tem WitH a NonLINEAR CONTROL 


RESPONSE OF ENGINE 
Two Eqvat 


PuysicaL NONLINEAR EQUIPMENT 


Methods of producing the mathematical operations needed to 
yield the nonlinear control functions of this paper are well known. 
Thus the use of d-c circuits for differentiating electrically and 
dashpots for doing this mechanically is classical. 

The use of absquaring physical components is also classical. 
Nonlinear resistors exist for which the current is proportional to 
the absquare of the voltage. The pressure drop across a sharp- 
edged orifice is proportional to the absquare of the flow. Thus 
the absquare is easy to produce by physical devices in common 
use. However, the improvement of the transients in a problem 
must be weighed against the cost of including nonlinear terms in 
the control function. 


EXPERIMENTAL RESULTS 


All of the points of the theory in this paper have been checked 
experimentally in the laboratory of the author’s company by 
electronic and other means on simulated and actual engines. 

Some experimental results are shown in Figs. 22 and 23. Fig. 
22(a) is for a one-lag ('/s-sec time constant ) system where Formula 
[89] is used for control. This figure shows the response to instant 
load rejections and increases. The curves in Fig. 22(6) are the 
same as for Fig. 22(a) except that the lag with ‘/;-sec time con- 
stant has been split into two identical lags with '/\-sec time 
constant. Fig. 22(c) is the same thing with the '/;-sec lag split 
into three '/,s-sec lags. The asymmetry of the rpm swings in 
Fig. 22(c) is due to minor errors in the equipment and is to be 
disregarded. The vertical scales for Figs. 22(b) and 22(c) differ 
a little from those for Fig. 22(a). For Figs. 22(b) and 22(c) the 
time constants are lumped into one in the control Formula [89]. 
To eliminate the oscillations near equilibrium in Fig. 22(c) one 
must employ a sizable linear band about equilibrium or a control 
formula with higher derivatives, or some other technique such as 
a deadband. In Fig. 23 is shown the rpm response of an engine 
to a load increase and a load rejection for the case where the 
engine and filter in the system have four lags, with time con- 
stants 0.4 sec, 0.1 sec, 0.1 sec, and 0.05 sec, respectively, and a 
control formula of Type [89] with second derivative is used. 
Here the 0.4-sec lag dominates the others. 


TIME 


RESPONSE OF ENGINE WITH 
Taree Lacs 


Fig. 22(c) 


TIME 


Fic. 23. Response or System WHERE ONE LAG Dominates THREE 


OTHERS 


SERVO 


LOAD: 


Fie. 24(a) Linear Response To Loap ReJEcTION 


TRANSACTIONS OF THE ASME 


In Figs. 24-27 are shown the experimental results of using an 
absquare term (Formula [103]) for prime-mover systems where 
the disturbances are so small that the servo speed never reaches 
a maximum value. Fig. 24(a) gives the response of such a system 
to load disturbances where the governor is a linear one adjusted 
to give the best results that can be obtained with a linear control 
function where servo speed c’ is a linear combination of speed 
deviation m and acceleration m’. In this figure load is taken 
on and then rejected. In Fig. 24(6) is shown the response of the 
same system to the same disturbances when a practical amount 
of absquaring of the acceleration m’ is introduced; i.e., Formula 
[103] is employed. Note the improvement in speed deviation by 
an order of magnitude. Note also that the part of the transient 
after the first swing is affected very little because the absquare 
plays a small role when one is near equilibrium. In Fig. 24(b) 
the case of load rejection is shown first, whereas in Fig. 24(a) it 
is shown last. 


RPM 


SERVO 
w— ~ | 
LOAD 
——+ TIME 
(a) (b) 


Fig. 25(a) Linear RESPONSE 
TO DisTURBANCE 


Fig. 25(b) Nonuingear Ke- 
SPONSE TO SAME DISTURBANCE 


SERVO 
SPEED 


SERVO 


——> TIME 


Fig. 24(b) Nonurnear Response To Same DisTuRBANCE 


Wirth Same Speep Deviation 


For the curves of Fig. 25(a) the gain of the linear governor used 
for Fig. 24(a) has been raised as high as possible without having 
the system break into a sustained hunt. By increasing the gain 
so as to obtain the same linear performance (i.e., characteristic 
roots) as in Fig. 24(a) and introducing a term in the absquare of 
the acceleration in Formula [103] the same overswing for load 
rejection is obtained, as shown in Fig. 25(b), but the oscillations 
have been removed for practical purposes. In Fig. 26 the con- 
stants in a linear governor [the governor of Fig. 24(a)] for the 
same prime-mover system have been adjusted to give the same 
rpm overswing for load rejection as in Fig. 25(a) and rpm tran- 
sients as near those of Fig. 25(b) as possible. Note how slowly 
the rpm curve of Fig. 26 drags in to equilibrium. Note also 
the oscillations, evident from the servo speed trace. 

The response of an engine to quarter-load rejections under 
linear and nonlinear hydraulic governor contro] is shown in Fig. 
27. The linear governor represented by Equation [70] was ad- 
justed to the border of instability so that a slight numerical in- 
crease in the coefficients of the control formula resulted in hunt- 
ing. This was done to make the overswing a minimum, The 
nonlinear governor, based on Formula [103], was adjusted so 
that a good transient was obtained for Fig. 27 without encounter- 


| 


APRIL, 1957 


et 


——> TIME 


Linear Response INITIALLY TO Non- 
LINEAR RESPONSE 


Fic. 26 


ing instability. Note that the area under the spike for the non- 
linear case is about half of what it is for the linear. For this test 
the linear and nonlinear governors were actually the same gover- 
nor except for the physical components generating the absquare 
in the nonlinear case and the derivative term m’ in the linear. 

Thus even though the servo speed does not attain tts maximum 
value for the disturbances under consideration, the introduction 
of the absquare can be used to substantially improve the response 
to the “larger” of the disturbances to be encountered. 


SUMMARY 


In unpublished work the author has treated such topics as the 
effect of deadband, changing gain in reaching a linear control 
band about equilibrium, the use of linear and nonlinear approxi- 
mations to nonlinear control functions, the effect of a second 
sudden disturbance before the first has died out, the employment 
of different types of linear control zones near equilibrium, the 
use of an arbitrary g(c), the determination of control functions 
for higher-order systems, and other topics which need to be cov- 
ered in a complete treatment of the subject, but which will be 
omitted here. 

Instant load rejections and the corresponding transients are 
of considerable concern to the user of speed governors. Let O(D) 
be a polynomial in D with positive coefficients. To obtain 
optimum transients in the case 


O(D)m’' = Kie — l. [104] 


Me... [3) 


it is necessary to use control functions which include 


where {y’} is the absquare |y’|y’ of and 
= O(D)m.. ... [106] 


Since > in Formula [105] involves the third or higher deriva- 
tives of m if O(D) is of the second order or higher, due to noise 


LINEAR NON-LINEAR 


Fie. 27) Partiat-Loap Resections ON ENGINE 
precise optimum transients cannot be realized physically for 
higher-lag systems. 
Good transients can often be obtained by using 


ce’ = —Kz 


[107] 
as the only nonlinear control formula, where K is as large as 
possible, and a linear band, dead zone, or something else is em- 
ployed to achieve stability where necessary. If the equation is 


m’ + f(m) = Kic {108} 


1 
O(D) 
with a damping term f(m) that does not decrease as m increases, 
this term may be dropped and good results obtained by applying 
the theory for Equation [104]. Let y denote m + rm’. In 
the case where O(D) is the expression (TD + 1) only one control 
function 


—(sgn + (: + [sgn Vv’) ) Ka 


. [109] 


is needed to give optimum transients where {y’} is the absquare 
of w’. When the log term in Formula [109] is imaginary it is 
dropped. 

Formula [105] may often be replaced by 


=m +(r + Blm’|m’). {110} 


especially where the servo will not attain its maximum speed. 
Formula [110] may frequently in turn be replaced by a linear 
expression by using an average value of | m’|. 

When higher derivative terms in ¢ are introduced on the right 
of Equation [104] the function © of Formula [105] can still be 
used. Ifa term inc’ is introduced on the right in Equation [104] 
the nonlinear theory of this paper breaksdown. If dead time rz 
is introduced into Equation [104] 


274K 


= y Re + {111} 
1412 


may be used to treat large disturbances. 


ACKNOWLEDGMENT 


The author is indebted to Mr. William I. Caldwell of the Tay- 
lor Instrument Companies, Mr. G. Forrest Drake of the Wood- 
ward Governor Company, and others for valuable suggestions 
which were used to improve the exposition of this paper. 


BIBLIOGRAPHY 


1 “Nonlinear Techniques for Improving Servo Performance, 


where 
| 


542 


by Donald McDonald, National Electronics Conference, vol. 6, 1950, 
pp. 400-421. 

2 ‘Multiple Mode Operation of Servomechanisms,”’ by Donald 
McDonald, Review of Scientific Instruments, vol. 23, no. 1, January, 
1952, pp. 22-30. 

3 “A Phase-Plane Approach to the Compensation of Saturating 
Servomechanisms,” by A. M. Hopkin, Trans. AIEE, vol. 70, part I, 
1951, pp. 631-639. 

4 “A Topological and Analog Computer Study of Certain 
Servomechanisms Employing Nonlinear Electronic Components,” 
by R. C. Lathrop, PhD thesis, University of Wisconsin, 1951. 

5 “The Stabilization of On-Off Controlled Servomechanisms,”’ 
by A. M. Uttley and P. H. Hammond, Automatic and Manual Con- 
trol, 1952 publication of the 1951 Cranfield Conference, pp. 285-307. 

6 “Differential Equations With a Discontinuous Forcing Term,” 
by D. W. Bushaw, Report No. 469, January, 1953, Experimental 
Towing Tank, Stevens Institute of Technology, Hoboken, N. J., PhD 
thesis, Dept. of Math., Princeton University, Princeton, N. J. 

7 ‘Predictor Servomechanisms,"’ by L. M. Silva, Trans. IRE, 
vol. CT-1, no. 1, March, 1954, pp. 56-70. 

8 “Nonlinear Optimization of Relay Servomech " by 
L. M. Silva, University of California Institute of Engineering Re- 
search, series no. 60, issue no. 106, April 15, 1954. 

9 ‘Predictor Control Optimalizes Control-System Performance,” 
by L. M. Silva, Trans. ASME, vol. 77, 1955, pp. 1317-1323. 

10 “An Investigation of the Switching Criteria for Higher Order 
Servomechanisms,”” by Irving Bogner and L. F. Kazda, Trans. 
AIEE, vol. 73, part II, Applications and Industry, 1954, pp. 118- 
127, Paper 54-44. 

11 “Differential Equations,” by R. P. Agnew, McGraw-Hill Book 
Company, Inc., New York, N. Y., 1942. 

12 “Optimum Switching Criteria for Higher Order Contactor 
Servo With Interrupted Circuits,’’ by S. S. L. Chang, ATEE 55-549. 

13 “Analysis and Design Principles of Second and Higher- 
Order Saturating Servomechanisms,”’ by R. E. Kalman, AIEE 55- 
551. 

14 “Errors in Relay Servomechanisms,”’ by L. F. Kazda, Trans. 
AIEE, vol. 72, part II, Applications and Industry, 1953, pp. 323-328. 

15 “Effects of Friction in an Optimum Relay Servomechanism,” 
by T. M. Stout, Trans. AIEE, vol. 72, part II, Applications and In- 
dustry, 1953, pp. 329-336. 


Appendix 1 


We shall consider a system with dead time given by the 
equation 


M' =e 


with 


Suppose that 
L 2 (vV2- 


If the load L is suddenly dropped at 7 = 0 an optimum transient 
is obtained where one goes through three phases corresponding 
toC’ = —1, +1,0. Before the disturbance 


The change from C’ = —1 to C’ = +1 must occur when the 
control function = given by 


(M’ — 2r,)? 

2 
is zero. From 7 = 0 to T = tr, we have M’ = Land M = LT. 
If L is sufficiently large, as when L = 27,4, we have > > 0 and 


=’ < 0 before the instant when C’ changes from —1 to +1. 
When 


>> = M 


we must switch to C = 0. The C-curve reaches equilibrium 
T¢ units before this happens for /. 


TRANSACTIONS OF THE ASME 


Corresponding results are obtained if load is suddenly taken 
on. For large enough L relative to tz we can replace the last 
term in Formula [115] by the absquare term {.M’ — 2r,} /2. 

Modifications of the theory apply if L is small relative to 4. 


Appendix 2 


We consider a system with Equation [93] subject to the Condi- 
tion [113]. Here O(D) is any polynomial with positive coeffi- 
cients. Introducing Y as in Formula [95] we derive Equa- 
tion [96]. Consider the last phase of an optimum transient 
that leads M to equilibrium. Suppose that C’ = +1 for this 
phase, whence C == 0 thereafter. We cannot have Y = 0 at the 
start of the phase since Y” = +1 until the end of the transient 
when W becomes zero. Thus the shift from C’ = —1 to C’ = 
+1 in entering the last phase (before M = 0) occurs when = 
given by 


{117} 


becomes zero. Thus the absquare occurs in the last switching 
function. 

The foregoing treatment holds equally weil when (’ = —1 
before equilibrium. 


If Equation [93] is replaced by 
O(D)M’ = 
we let 
U 
whence Equation [118] goes into 
O(D)M’ = U.... {119} 
where Relation [113] implies that 
£1.... 
We can write ¥ for O(D)M and obtain 


The argument at the beginning of this Appendix applies except 
that C’ must switch values 7, units of time before it would if 
the dead time were absent (see Appendix 1). For sufficiently 
large load rejections or acceptances the final phase before equilib- 
rium (i.e., the last phase for which Y” # 0) is initiated when 
the control function 


[120] 


{121} 


becomes zero. The absquare thus occurs in the treatment of a 


system with the Equation [118]. The top signs in Formula [122] 
apply to load rejections and the bottom to load acceptances, 


+ 


Discussion 


M. J. Nowak. The author develops the absquare control 
function, with particular application to the optimum governor 
speed control of an engine. In general the equation for this type 
of servo application is 


. [123a] 
[1236] 


O(D)M’ =C 
<1. 


This general equation represents a large class of servo applica- 


* Fellow, Engineering Mechanics Division, Stanford University, 
Stanford, Calif. 


= 


APRIL, 1957 


tions, for which a particular interpretation is the case where M 
is an engine-speed deviation and C is the position of a controlling 
governor; the characteristic of the governor is that its maximum 
speed is limited (e.g., this may represent the rate of opening of a 
fuel valve). 

For this control situation the intuitive idea of having the 
servo travel at maximum speed to obtain optimum transient re- 
sponse is used and justified by the author to develop the absquare 
control function 2 for the case where engine time lags are neg- 
ligible so that O(D) = 1 


=M+'/,|M'|M’. [124] 

In general a control function is a function of the process error 
and its time derivatives, whose sign determines the direction in 
which the servo is moving (at maximum speed). These required 
properties of the motion are sensed and combined into the con- 
trol function; then the times at which the control function is zero 
determine the switching points for the servo. 

It turns out that absquare control has use not only in the simple 
case analyzed by the author but also in the general Equation 
{123}. This more general case will be developed briefly for arbi- 
trary engine time delay and arbitrary initial conditions of engine 
speed. The result is that even in this general situation the con- 
trol function has the form 


| af’) 


Thus the exact switching function contains corrections to the 
absquare function of first and second order in the equivalent time 
constant of the engine. 

For such step switchings involved in this type of discontinuous 
control the Heaviside transformation theorem is useful; for a 
differential equation which relates an output y to a step 
input 


y+ ay” + ay? +...+a,y™ =z. 
= (1+ 7,\D)...(1+ 7,D) 


[126a)} 
[126)} 


the usual Heaviside transformation is 
n 
———e 


(= 


= = .. [127] 


The limitation in using this formula is that it applies only to 
the situation where there is no motion prior to applying the step 
input, whereas in discontinuous switching control the servo 
may be reversed several times under arbitrary conditions. How- 
ever, these arbitrary initial conditions can be taken into account 
by using operational methods. Thus the ori*inal equation can 
be integrated once by using the operator D = 1/D and applying 
the initial conditions 


Dr = Dy + ay + ay +... + a,y" 


—t/Te 


yO=y+2 


— (aryo + a2yo? +... + a,yo%-”) 
This process can be repeated for n-integrations to obtain 
Der = Dry + + + any 
— (a, + +... + aD 
— (a,D + a,iD* + ..[128b] 


— a 


548 


By now applying the operator D” the result can be put in the 
following form 


Oy Ooyo + Ory . [129] 
) 
Ov(D) = (a, + a,D cen a,,D’ 
0,(D) = (a2 +... +a,D"-*)D 
[130] 
0,.(D) = a,,D 
n l k 
O,(D) = a,D'-* = D* | ow | 
t=k+1 1=0 


The output is now produced not only by the applied step but 
also by steps due to the initial conditions. The usual Transfor- 
mation [127] can be applied to each step to obtain the complete 
output; thus the general Heaviside transformation for arbitrary 
initial conditions is 

— 
k=1 i=0 
T, 
To facilitate application of the formula the following relations 
are useful 


[132] 


| | 
| 


—T,0, (5) — Gri 


For the servo application [123] the formula can be applied to the 
equation 


S 

z 
sl 


O(D)M* = C’ 
where C’ is a step equal to + 1 


= C’— 


The transient term on the right may be denoted by M” and the 
equation integrated twice with initial conditions applied 


(M + M) = (Mo + Mo) 


M’ 


| [133] 


1 
(M,’ + M,')t + 2 (M,” + M,")@ {134a] 


(M’ + M’) = (Mo’ + Mo’) + (Mo” + Mo”)t .. [134d] 


This form of the equation illustrates the author’s comment that 


544 


absquare control for the general case can be applied to the fic- 
titious speed 1 = M + M instead of the actual engine speed. 
Moreover, this formula can be used to develop a switching cri- 
terion for the actual engine speed in this general case. 

The general method to determine the switching criterion which 
leads the system to equilibrium with zero error is to find the 
time ¢, (in terms of the initial conditions Mo and Mo’) such that 
the servo forcing can be removed (stepped to zero) and the engine 
will coast to a final speed with no speed error. For a system 
with no engine time lags this condition on ¢, is that the speed 
error and acceleration are zero at 4; for an engine with time 
delays the condition is that M + M and M’ + M’ must be zero 
ati,. This important condition can be obtained readily from the 
general Heaviside transformation applied to the final switching 
toC’ = 0. 

It turns out that the simplified cases in which the engine time 
lags are neglected, or a single time lag is included, are insufficient 
to reveal the essential form of the switching criterion. 

In the general case an exact solution can be carried through 
in terms of two equivalent time constants 7’, 7’ of the order of the 
engine time constant, which are obtained from the general 
Heaviside transformation 


M.’ = —2C’T 


To indicate the development of these equivalent time con- 
stants consider first the cases of one and two engine time delays 
with switching points far enough apart that the engine can settle 
into its final state of forced motion at M” = C’. In these cases 
the following substitutions apply 


G + (3) | OC". ...... [136] 


Me = 2C'T(T —7)...... [135] 


One delay 
M = Mo = 2C’T:? 
T=0 
Two delays 
—2c’ 
M —T. 2 ] 
= —2C'(T; + T2) 
2C’ 
= — —t/T2 
M T,—T: 2°e ] 
Me = 2C'(T,;? + + 
= 7, 2 T, Ts 


In this method of equivalent time constants C’ is the maximum 
servo speed (+1) if the switching points are far enough apart; 
if the switching points are close together C’ may have some 
modified value obtained from Equation [136], depending on the 
previous switching. In any case Equation [1345] can be solved 
for 


= (M,’ — 2C’T) [137] 


This value of ¢; can be substituted into Equation [134a] and the 
exact equation for the switching criterion is obtained 

M’ 


+ 2TM’ —2C'T? ... . [125] 


TRANSACTIONS OF THE ASME 


The equivalent parameters 7’, 7, C’, which can be estimated 
from the general Heaviside transformation, actually depend on 
the time delays of the engine, the time between switching points, 
and the initial conditions of the previous switching; generally it 
may be too expensive and also unimportant to sense this com- 
plicated dependence exactly, and experimental coefficients can 
be used in the switching criterion. Furthermore, since the con- 
stant correction to the switching function is a second-order effect 
this general analysis supports the author’s practical switching 
criterion 


[138] 


It appears that if a more accurate switching criterion is warranted 
for large engine time delays a constant bias also should be in- 
cluded in the control function. 


T. M. Srour.* Many of the previous papers on optimum 
nonlinear control systems, including those by the writer, have 
described theoretical or analog computer studies or, in some 
cases, tests performed on experimental instrument servomecha- 
nisms. The author must be among the first to try these 
concepts in real systems, and his paper is therefore a genuine 
contribution. 

Familiarity may breed a mistaken notion of the complexities 
of a subject. The writer, nevertheless, considers the statements 
“The mathematics of this theory is far beyond the limits of this 
paper’ and “this theory is extremely complicated’’ to be un- 
warranted. The only permissible values of the manipulated 
variable that need to be considered are +1 and—1. In second- 
order systems, the corresponding response curves can be plotted 
in a phase plane. It is easy to visualize what combination of 
these curves is needed to reach equilibrium from any set of initial 
conditions and to determine the necessary control or switching 
function. The resulting transients can be shown to be optimum 
by making slight alterations in the switching procedure and show- 
ing that these increase the response time. For third-order sys- 
tems, corresponding arguments can be carried out in a three- 
dimensional phase-space. This essentially graphical but per- 
fectly rigorous approach is extended with difficulty to fourth or 
high-order systems, because of our inability to draw the multi- 
dimensional figures required, but the line of reasoning is still 
applicable. The strictly analytical approach appears unduly 
cumbersome, even for the simplest systems, so that the phase- 
plane or phase-space attack seems to be advantageous in all 
cases. 

The paper contains a number of statements and asides which 
might serve as material for several subsequent papers. The 
author’s remark, “There are initial conditions for which the 
optimum transient (in the single-lag case) is not one where 
the servo travels at full speed,’’ deserves elaboration, as does 
the special case 

M’'=C'+C 


The discussion of discontinuities, two-speed systems, dead 
time, dead band, approximations to the ideal control functions, 
and the effects of multiple, sinusoidal, and other disturbances 
likewise could be expanded. A question of some importance, not 
mentioned here, is the effect of intermittent operation on the 
power rating and life of the actuating device or servo. These 
matters require investigation before practical applications of 
optimum nonlinear control systems can be made. 

“Compromises to preserve performance but reduce cost” 
appear essential, and the author correctly stresses the joint im- 
portance of performance, engineering, and economic considera- 


* The Ramo-Wooldridge Corporation, Los Angeles, Calif. 


APRIL, 1957 


tions. One cannot help wondering whether his optimism 
concerning the potentialities of optimum nonlinear control is in- 
fluenced by an apparent, but possibly misleading, simplicity of 
the governor problems. In the cases discussed, low-orde: ‘. ‘er- 
ential equations seem to be adequate to describe the systems, 
and disturbances of interest seem to be well approximated by 
step functions. Application of these control concepts to more 
complicated systems, described by higher-order equations and 
subjected to less specialized inputs and disturbances, seems less 
promising. The author would doubtless agree that the theory 
of optimum nonlinear control should serve only as a guide in the 
empleyment of deliberate nonlinearities and should not be applied 
blindly for its own sake. 


T. J. Hiearns.’? This paper deals with what are referred to— 
at least in electrical engineering—as relay-type (or off-on) servo- 
mechanisms. The author cites a number of the principal papers 
that have been published on the analysis and design of such 
systems. A substantially equal number of yet other papers on 
these systems is to be found in the discusser’s exhaustive bibli- 
ography on nonlinear control systems.® 

To these many writings on relay-type systems the paper under 
discussion comprises a most valuable addition. First, the dis- 
cusser—who has read all of the mentioned published literature 
on relay servomechanisms—found it to be very clearly written, 
which is not the case for all the earlier published papers. The 
theory is developed in considerable detail, so that all points of 
the theoretical developments are easily understood and grasped. 
Also, the author has adopted the very excellent procedure of con- 
centrating attention on several of the simpler cases and has 
examined all possible initial modes of operation of each. Such 
procedure enables the reader to gain a clear insight into the physi- 
cal actions through pertinent interpretation of the correspond- 
ing mathematical analysis. It also renders very clear (to the 
qualified reader!) the general procedure to be followed in analyz- 
ing more complicated systems such as some of those mentioned, 
but not taken up by the author, and reveals some of the essential 
physical phenomena that must be dealt with in designing non- 
linear control systems, without obscurement by heavy mathe- 
matical manipulation. 

Finally, and on the practicing side, the advance of experimental 
data in confirmation of the analytical work evidences the correct- 
ness of the author’s work and provides substantial evidence of 
the great values of analytic procedures for determining optimum 
performance of complicated nonlinear systems. 

In conclusion, the writer would extend to the author his 
sincere appreciation of the pleasure afforded him by the reading 
of this well-wrought and lucidly written paper on a difficult and 
currently interesting phase of control theory. 


AUTHOR’s CLOSURE 


The author is in general agreement with the points raised by 
Mr. Nowak. The theme of Mr. Nowak’s discussion is that the 
absquare used by the author to treat systems without lags also 
applies to systems with lags. This was actually brought out in 
the paper in connection with Equation [93]. In fact, Mr. 
Nowak’s equation 


=C’.... 


obtained from Relations [133] is identical with the equation 


7 Department of Electrical Engineering, University of Wisconsin, 
Madison, Wis. 

8 A copy can be obtained by request to the Director, Engineering 
Experiment Station, Mechanical Engineering Building, University 
of Wisconsin, Madison, Wis. 


O(D)M" = C’ 


obtained from Equation [93] by differentiation, and the switch- 
ing function involving the absquare has already been derived in 
the paper. This function is given by 2 in Formula [26] when 
M is replaced by Y where 


and is identical with the function = in Formula [125]. The author 
notes that (M,)” + Mo”) in Equation [134a] is C’, and hence is 
equal to +1. His treatment of Equation [96], like that of 
Mr. Nowak, involves control on the basis of a fictitious speed 
¥, i.e., M + M (denoted by M in the discussion), and after 
attains the equilibrium condition y = 0, the engine will coast to 
the equilibrium value M = 0. The theory given in the discus- 
sion is thus the same as that of the author except for the point 
of view. 

The author disagrees with the derivation of Equation [129}, 
which is not mathematically rigorous. Starting with Equa- 
tion [126a] by integration and differentiation Mr. Nowak derives 
Equation (129], which is the same as Equation [126a] except that 
terms have suddenly appeared on the right. From a given equa- 
tion one cannot derive a new equation that contains more than 
at the start. Heaviside’s work itself was lacking in rigor. This 
invalidated some of his findings, but did not detract from the 
over-all value of his work. The same thing can be said about 
the discussion under consideration. 

Dr. Stout possesses a keen and thorough knowledge of the 
field in which the paper is written, and his remarks are well 
taken. He states that only the values +1 of the servo speed 
C’ need be considered in the argument. This is true if one re- 
stricts the theory to on-off servomechanisms. Workers in the 
field generally have taken this position. If one restricts oneself 
to the on-off case it is a simple matter to establish whether or not 
a given transient is optimum in the sense of the paper. In the 
theory developed by the author it is assumed only that C’ is 
between —1 and +1. Since a class of transients, optimum in 
the sense of this paper, requires that the servo speed C’ be in 
absolute value between 0 and 1, at least during part of the tran- 
sient, it is not sufficient to consider operation at saturation only. 

Although in industry one is primarily concerned with the 
production of economical engineering designs, regardless of how 
these designs are obtained, it is nevertheless valuable to have the 
analytical treatment at hand. The analytical, that is rigorous, 
mathematical treatment of a theory is absolutely necese2ry for an 
understanding of the applications of this theory. This is a point 
of view that the author knows Dr. Stout shares with him. 


The treatment of the case 


for a constant T is not complicated and will be given here. It is 
assumed that |C’| is bounded by 1, asin Relations [12]. Suppose 
that at 7 = 0 we have the initial conditions 


M=WM,, M’=M,', C’=0, C=M,’' 
The best transient is obtained by letting 
C’ = —(egn M) 
when M # 0, and 


=—— 
T 


when M = 0. Note that if My) > 0 and C’ 
we have 


= —lfor7>0 


= —T 


M' 


M = My +(Me’ — T— T? 


Now M = 0 when 
T = (Mo! + V[(Mo’ — r)* + 
At this value of 7 
C = — 1)? + 
For the next phase we keep M = 0 by letting 
+C=0 


whence 
C = {r—V((Mo’ — 7)? + 


where we have taken 7 = 0 at the start of this phase. The initial 
C’ is then 


+ 2Mo) _ 


T 
The quantity C’ can attain this value only if 
lr — ((Mo’ — ++ 2M ]| S7 
This condition is not satisfied if 
Vv ((Mo’ — 1)? + 2Mo] > 27 
If My < 0 the Inequality [142] is replaced by 
V [(Mo’ + 2M] > 2r 


Thus C can coast to equilibrium according to Equation [141] 
only if |Mo| and |M,’| are sufficiently small. 

Thus even though the ideal transients are not obtained by letting 
the servo travel at only full or zero speed, there exist control func- 
tions depending on M and C only that yield the ideal transients. 
In fact, these transients are obtained by letting 


C’ = 
where K is very large (infinite in the ideal case) and 


TRANSACTIONS OF THE ASME 


for M = 0 while 


for M = 0, provided that 7 ¥ 0. 

That the author’s enthusiasm for the nonlinear approach of 
the paper is colored by his work on governors is no doubt correct. 
Customers normally test prime-mover governors by making 
sudden load rejections or acceptances. Such sudden changes 
often occur in practice as when a generating unit is separated 
from the line and concern about the resulting transients is 
justified. However, the general principles of the paper were 
verified on a Philbrick analog computer simulating controlled 
systems of a rather general nature. These studies showed that 
the use of the absquare is beneficial for a wide range of applica- 
tions where the disturbances are not necessarily of the step 
variety. 

Dr. Stout is correct in stating that the theory serves as a 
guide. The improvement that can be obtained in practice by 
the application of the theory of the present paper was most dis- 
appointing to the author, who completed the theory as far as he 
felt it had to be carried for engineering applications, before initiat- 
ing the experimental work to verify it. Improvement in maxi- 
mum overswings of 2:1 for major disturbances without changing 
the quality of the transients is often about the best that can be 
attained where the application of the theory is indicated. In 
comparing linear and nonlinear controls one must be careful to 
employ the best control with a linear control function involving 
as many derivatives as is practical (up to the second) and arbi- 
trary coefficients and subject to arbitrary discontinuities. 

Professor Higgins is correct in his statement that the paper 
concerns on-off servomechanisms. The paper actually treats 
servomechanisms in general where the rate of change of the 
controlling variable is bounded, and this rate can be made to 
assume arbitrarily any value between its minimum and maxi- 
mum. The paper is primarily concerned with transients that 
can be made optimum by operating a servomechanism as if it 
were of the on-off variety. Interest in the field of the paper is 
indicated by the other papers on the subject that are listed in 
Professor Higgins’ superb bibliography on nonlinear control 
systems. 


546 
Kr 


On the Analysis of Linear 
and Nonlinear Systems 


By MARVIN SHINBROT,' MOFFETT FIELD, CALIF. 


A general theory of a certain class of commonly used 
methods for the analysis of linear systems from measured 
response data will be described. It will then be shown 
that, when viewed from this general point of vantage, all 
of these linear techniques can be extended in a natural 
way to apply to nonlinear systems. In addition, through 
use of the general theory, a new method, possessing cer- 
tain advantages over those used previously, will be derived. 
Finally, the effectiveness of the new method will be illus- 
trated by several examples. 


INTRODUCTION 


HIS paper will be concerned with what has been called the 

‘inverse’ problem of system analysis, i.e., with the prob- 

lem of determining the differential equations @overning 
the behavior of a system, given a time history of the response of 
the system to some input. This problem has been of interest to 
aerodynamicists for some years, and the National Advisory Com- 
mittee for Aeronautics has published several reports on the sub- 
ject. Some of this work can be found in references (1 to 4).?_ Re- 
cently, workers in other fields appear to have become interested in 
the problem (see, e.g., reference 5). 

Only two of the preceding references deal with nonlinear sys- 
tems. The first of these (4) presents a method which, as men- 
tioned in the report itself, is not only time-consuming but rela- 
tively inaccurate. Reference (5), which also concerns itself with 
nonlinear systems, applies only to those which are “slightly’’ so. 

Last year, the National Advisory Committee for Aeronautics 
published (6). In this report, an entire class of methods 
was presented having no theoretical restrictions (such as slight 
nonlinearities, and the like) and with accuracy limited only by the 
accuracy of the experimental data themselves. Furthermore, the 
time required to apply one of the methods is no greater than that 
required by a method which has been used for years for the analy- 
sis of linear systems. The methods were obtained as a generaliza- 
tion of certain known linear techniques which were called ‘‘equa- 
tions-of-motion’’ methods in (7). The present paper is devoted 
to a somewhat heuristic account of the theory and methods of (6). 

The paper begins with a brief description of three of the most 
widely known equations-of-motion methods. This is presented 
in order to show how a general theory of such methods can 
be constructed. It is then shown how this general theory can be 
used to extend the methods to apply to nonlinear systems. Next, 
a new method, superior in certain respects to methods used here- 
tofore, is discussed briefly. Two examples are displayed to indi- 
cate the efficacy of the new method. 


1 Aeronautical Research Scientist, Ames Aeronautical Laboratory, 
National Advisory Committee for Aeronautics. 
2? Numbers in parentheses refer to the Bibliography at the end of the 


paper. 

Contributed by the Instruments and Regulators Division of THe 
AMERICAN Socrety oF MecHanicat ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
2,1956. Paper No. 56—IRD-2. 


In all that follows it will be assumed that there is only one 
equation of motion of the system and that it is of the second 
order. For more general situations, see reference (6). 


Some Equations-or-MoTion MEtuHops 


In reference (7), all of the known methods for linear systems 
were classified, somewhat unharmoniously, either as “response- 
curve-fitting’? methods or as “equations-of-motion’’ methods. 
Since the methods of the former class all involve the assumption 
that the equations of motion may be solved explicitly in terms of 
the parameters of the system, clearly it would not be possible to 
extend them to apply generally to nonlinear systems. (Milliken 
would have classed the method of reference 5 as a response-curve- 
fitting method; hence the restriction in that method to slight non- 
linearities.) As a consequence, the methods of (6) were general- 
izations of the equations-of-motion methods. 

We shall begin with a discussion of three of these methods, as 
they apply to a linear system. It is assumed that records of two 
variables x(t) and F(t) are available. From these, it is desired to 
evaluate three constants b, k, and C, such that the given data 
satisfy the equation 

# + bt + kr = CF(t) 
as closely as possible. 

The first of the techniques to be discussed was given the name 
of the “derivative’’ method by Greenberg in (1). It consists in 
considering Equation [1] for fixed ¢ as a linear equation in }, k, 
and C, instead of as an equation for z. Allowing ¢ to vary results 
in a set of such equations which can be solved by least squares for 
the desired constants. This process gives the equations 


f, sae +h ix dt cf, iF dt 


— fy 


0 
0 0 0 
=— f dt 
0 
0 0 0 
= f, Fz dt 
From the given record of z(t), one can find 2(t) and #(t), calculate 
the integrals which are the coefficients in Equations [2], and then 
solve Equations [2] for b, k, and C. It should be noted that 
Equations [2] are obtained from Equation [1] by multiplying the 
latter by the functions #(t), x(t), and —F(t), respectively, and 
then integrating the results. 

The ‘Laplace transform’’ method arises when one takes the 
Laplace transform of both sides of Equation [1]. Supposing for 
simplicity that /(0) = 2(0) = 2(0) (this is not essential to the 
method), the transformed equation is 


bst(s) + k&(s) — Co(s) = —s*&(s).... 


where £(s) and ¢(s) are the transforms of x(t) and F(t), respec- 
tively. These transforms can be computed for several values of s 


547 


548 


and the resulting equations solved by least squares for b, k, and C. 
Note that Equation [3] is equivalent to 


b + e-*x(t)dd 


since integrating by parts the appropriate integrals of Equation 
[4] gives 


which is just Equation [3] with the functions &s) and ¢(s) re- 
placed by their definitions in terms of the functions x(t) and F(t). 
Looking at Equation [4], it can be seen that Equation [3] is ob- 
tained by multiplying Equation [1] by e~* and integrating. In 
the Laplace transform method, these integrals are computed for 
a certain number of values of s and the resulting Equations [3] 
solved by least squares for the desired constants. 

The last method to be discussed is the so-called ‘‘Fourier trans- 
form’’ method. It is essentially the same as the Laplace 
transform method except that the Fourier transform replaces the 
Laplace transform. 

For more details of these methods, see reference (1). 


GENERAL THEORY FOR LINEAR SysTEMS 


In order for the general theory to evolve, it is helpful to forget 
the ideas behind each of the methods described in the preceding 
section and to keep firmly in mind the computational procedure 
which leads to each method. Each of the methods considered 
can be described in these terms as follows: 

The derivative method arises when 


(a) Equation [1] is multiplied by the three functions #(¢), x(t), 
and —F(t) one at a time. 

(b) The resulting equations are integrated from zero to infinity. 

(c) The equations to which step (b) leads are solved for 5, 
k, and C, 

We say the Laplace transform method is being applied when 


(a) Equation [1] is multiplied by N(>3) functions e~**, 

(b) The resulting equations are integrated from zero to infinity. 

(c) The equations to which step (6) leads are solved (by least 
squares since N is usually chosen greater than 3) for b, k, and C. 


The Fourier transform method results when, in the description 
of the Laplace transform method, the functions e~*"* are replaced 
by e~t#nf (or, alternatively, by cos w,¢ and sin w,t). 

That there is a general feature underlying the three methods 
which have been discussed is now clear. In each of them the 
same steps are performed, multiplication, integration, and solu- 
tion of the resulting equations. Quite generally, then, one can 
develop equations-of-motion methods by selecting a set of func- 
tions y,(t) (called the “method functions’’), n = 1, ..., N for 
some N > 3, and then 


(a) Multiplying Equation [1] by each of these functions. 
(b) Integrating the resulting equations. 
(c) Solving these new equations for the desired parameters. 


In the three methods described, the integration of step (b) pro- 
ceeds over the interval (0, © ); as will be seen, this is not essential 
and, indeed, the fact that it is inessential can be used noticeably to 
improve existing methods. To avoid some complications 
initially, we shall continue to integrate over the infinite interval; 
this restriction will be removed subsequently. 


TRANSACTIONS OF THE ASME 


It should be noted that reference to a difference which exists in 
step (b) between the derivative method and the other two methods 
has been suppressed in order to get the ideas across better. This 
difference lies in the fact that in step (b) of the Laplace and 
Fourier transform methods, an integration by parts is performed. 
We shall discuss this again in the sequel. 

The process described in the preceding paragraph leads, after 
step (b), to N equations of the form 


It is possible that functions y,(t) depend on the experimentally 
determined functions z(t) and F(t); this is the case for the 
derivative method, in which = 2(t), w(t) = x(t), w(t) = 
—F(t). If this is so, Equations [5] can be considered as N 
equations which are to be solved (by least squares if N > 3) for 
the desired parameters. Of course, this process requires the (in- 
herently inaccurate) calculation of the derivatives z(t) and #(t) 
from the experimental data. If it is desired to eliminate such a 
step, the y,(¢) must be chosen as explicitly independent of z(t) 
and F(t), as in the case of the Laplace or the Fourier transform 
methods, If this is the case, the following formulas, obtained by 
integration by parts, are used 


at = —s(0)2(0) — a 


vet dt = + ae 
Substitution into Equations [5] gives 


Equations [6] are N equations in b, k, and C. If N > 3, they can 
be solved by least squares for those parameters. 


GENERALIZATION TO NONLINEAR SYSTEMS 


We have indicated how a general theory of equations-of-motion 
methods for linear systems is developed. We now proceed to 
consider nonlinear systems, in particular systems subject to the 
equation 


#+f(z)t + g(x) = 0 


It should be said that it certainly is possible to treat more general 
situations, in particular situations in which there is a forcing 
term on the right side of the equation, but Equation [7] suits ad- 
mirably the needs of exposition. 

In order to apply the method, f(z) and g(z) are approximated by 
polynomials. Although more complicated polynomials can be 
considered, we shall restrict ourselves here to the case where 


f(x) = bo + bia + 
g(x) = + kit + kext) | 


There are no conceptual difficulties in the generalization to higher- 
order polynomials. The problem can now be stated as follows: 
Given an experimental record of x(t), determine six constants 5; 
and k; such that x(t) satisfies Equation [7] as closely as possible, 
with f and g being given by Equation [8]. 

Manifestly, it is possible to generalize the method of the pre- 
ceding section to solve this problem. Select N method functions 
y,(t). Multiply Equation [7] by these functions and integrate. 


APRIL, 1957 
Integration by parts then gives 


—bo [ Unk a] — b, [ 
0 


1 
f Ynx* a| — — [ + ynx* a] 
0 3 0 
mf dt + ky f Y, v2 dt + y,x* dt 
0 0 0 


= y,(0)i(0) — y,(0)x(0) 9.7 ad, n=1,...,N... [9] 
0 


The integrals occurring in Equations [9] can be evaluated. 
Following this, Equations [9] can be solved for the desired con- 
stants. 


CHOICE OF THE Metuop FuNcTIoNs 


To this point in the discussion, the method functions y,(¢) have 
been to a great extent arbitrary, having to satisfy only certain 
weak smoothness conditions so that the integrations by parts 
which led to Equations [6] and [9] were allowable. In this sec- 
tion, a few comments will be made on how this arbitrary character 
of the method functions can be utilized to devise new techniques, 
superior in point of accuracy to methods heretofore used. 

With the exception of the derivative method, to which the follow- 
ing remarks do not apply (but which has its own troubles), any 
equations-of-motion method not carefully selected will be prone 
to certain unpleasant features. Upon inspection of Equation [6] or 
(9], the first thing which strikes the eye is the peculiar role 
played by the initial point ¢ = 0. It is clear that a small change 
in the initial values can produce a sizable change in the calculated 
values of parameters while this is true of no other point. This 
feature is clearly undesirable. The cure can be seen from either of 
Equations [6] or [9] to be to choose the method functions such 
that 


= y(0) = 0, n=1,...,N......... [10] 


Choosing method functions with this property eliminates the 
heavy dependence on the initial conditions’ and reduces Equation 
[9] to the following 


1 
0 0 
dt + f dt 
3 0 0 


+h f dt + f dt 
0 


0 
0 


Another objection to the methods discussed so far is the in- 
finite interval of integration which occurs in Equation [11]. 
Naturally, no experimental data can ever be obtained over an in- 
finite interval of time. Consequently, it would be nice if the 
infinite upper limit on the integrals occurring in Equation [11] 
could be replaced by a finite limit. Suppose the data available to 
be 7 seconds long. We cannot blindly replace © by T' in Equa- 
tion [11], for the by-parts integrations which led to Equation [11] 


3 The Condition [10] would be replaced by the condition 
(02) = 0, k= 0,1,...,K —1 


if K is the highest derivative occurring in the basic equation assumed 
to describe the motion. 


549 


would give rise to terms, analogous to the terms eliminated by 
the Condition [10], depending on the final conditions when ¢t = T. 
These terms cause the end conditions to be weighted unduly. 
They can be eliminated by adducing to Conditions [10] conditions 
of the form 


yAT) = y,(T) = 0, n=1,...,N.......,. [12] 
If Conditions [10] and [12] are assumed, the basic Equation [11] 
becomes 


T T 
0 0 

T 
= | dt, 

0 


The method functions are still quite arbitrary; they only must 
satisfy Equations [10] and [12] in order that [13] follow. In 
order that all the data be given equal weight, the further condition 
that the method functions do not consistently approach zero some- 
where may be imposed. (This eliminates the Laplace transform 
method and, for stable systems in which the output approaches 
zero, the derivative method.) In (6), method functions of the 
form 


n=1,...,N......[13] 


were considered. Naturally, these functions, while satisfying 
Equation [10], will not in general satisfy [12] unless the frequen- 
cies w, are properly chosen. If the w,, are chosen according to the 
rule 


Conditions [10] and [12] will both be satisfied. This choice of fre- 
quencies leads to an elegant method which gives satisfactory 
results in certain cases. On the other hand, the difference (4/7') 
between two successive frequencies is too large to define the “‘fre- 
quency response’ (to use loosely the terminology of the Fourier 
transform) of some examples adequately unless 7 is quite large. 
Hence, in (6), more frequencies were inserted between the Fre- 
quencies [15]. In order that [12] continue to be satisfied, the 
method functions were chosen of the form of Equation [14], except 
that after the last zero of [14] in the interval 0 < ¢ < 7’, the func- 
tion was cut off, so that some of the method functions were 
identically zero over a small part of the basic interval. For fur- 
ther details of this method, the reader is referred to (6). The 
efficacy of this choice of method functions is illustrated by the 
examples to foliow. 


EXAMPLES 


Two examples will be given. The first is that of a slightly non- 
linear hardening spring and the second that of the van der Pol 
oscillator. 

For the first example the equation which it is assumed the data 
satisfy is 

# + bt + g(x) = O............ . [16] 


For the example under consideration, b was chosen equal to 2 and 


1 
f iat dt — > by f qx? dt 
0 0 
1 T 
at +e at 
| 
| 
SU 


550 


g(x) was chosen as the solid curve in Fig. 1.‘ The “measured’’ 
data were manufactured by solving Equation [16] on a REAC. 
These data are displayed in Fig. 2. The method described was 
then applied to these data, assuming the form 


4 A multiplicative factor was omitted when Fig. 1 was plotted. 
In order to correct the figure, the vertical scale should be multiplied 
by —0.01. Note that the minus sign on this factor effectively in- 
verts the figure, thus making the system stable, which is as it should 


800 


600 —— TRUE 
---- CALCULATED 


400 


200 


-200 


-400 


-600 


-800 
-12 -08 -04 
x 


Fie. 1 Nonvuinear Sprinc or Exampie 1 


TRANSACTIONS OF THE ASME 


g(x) = x(ko + + hex?) 
forg. This procedure gave the results 


6 = 1.95 ky = —30.5 
ko = 50.4 ky = 806 


Since the starting value of b was 2, we see that b has been found 
with an error of 2.5 per cent. The values of ko, ki, kz: were sub- 
stituted into Equation [17] and the resulting function was 
plotted as the dotted curve in Fig. 1. It can be seen that the 
error at the least accurate point is less than 3 per cent. 

For the second example, the equation 


#+ +2 =0 
was solved on the REAC giving the “‘data’’ of Fig. 3. It was pre- 


—- 


.08 


10 1 16 18 
TIME, t, SEC 


Fie. Data ror Examp.e 1 


25 


i 


L 
5 10 2S 


50 55 60 


30 35 40 45 
TIME, t, SEC 


Fre. Data ror Exampie 2 


be. 
= 
= 
= 
04 
-08 
-12 | 
20 
15 
1.0 
-5 
-1.0 
-1.5 
-20 
-255 


APRIL, 1957 
tended that no more was known about the equation satisfied by 
the data than that it was of the form 

+ (be + biz + + ke = 0 


and the method was applied to this equation. This gave the re- 
sults 


bp = —10.0 
by = —O0.14 


be = 9.42 
k = 0.96 
which clearly represents a good fit to the data. 


Conc.LupInc REMARKS 


A general theory of the so-called equations-of- 
motion methods for the analysis of dynamical sys- 
tems has been presented. It has been shown that, 
when looked at from a new point of view, all such 
methods can be generalized so as to apply to linear 
and nonlinear systems alike. Use of this theory 
also has shown how new methods can be developed 
to satisfy the requirements of particular problems. 

One new method has been described in some detail. In cer- 
tain cases, it reduces to the well-known Fourier transform method 
but in all cases has certain advantages over this latter 
method and over methods heretofore used. Its superiority is 
based on two facts: (a) There is the heavy dependence on the 
initial conditions which occurs when using most of the previously 
known equations-of-motion methods; this dependence is entirely 
eliminated in the new method. (6b) The fact that 
most of the methods used to this time demand an 
infinitely long record for their rigorous applica- 
tion; this demand is not made by the new method. 

Finally, it also might be mentioned that the time of 

applicution of the method is no greater than that for existing 
methods and that the method is well suited to machine computa- 
tion. 


BIBLIOGRAPHY 


1 “A Survey of Methods for Determining Stability Parameters of 
an Airplane From Dynamic Flight Measurements,” by Harry Green- 
berg, NACA TN 2340, 1951. 

2 “A Least Squares Curve Fitting Method With Applications to 
the Calculation of Stability Coefficients From Transient Resp 
Data,” by Marvin Shinbrot, NACA TN 2341, 1951. 

3 “A Description and a Comparison of Certain Nonlinear Curve 
Fitting Techniques, With Applications to the Analysis of Transient- 
Response Data,” by Marvin Shinbrot, NACA TN 2622, 1952. 

4 “Techniques for Calculating Parameters of Nonlinear Dynamic 
Systems From Response Data,” by B. R. Briggs and A. L. Jones, 
NACA TN 2977, 1953. 

5 “The Attenuation of Damped Free Vibrations and the Deriva- 
tion of the Damping Law From Recorded Data,” by K. Klotter, 
Stanford University Division of Engineering Mechanics, Contract 
N6-ONR-251, Technical Report 23, November 1, 1953. 

6 “On the Analysis of Linear and Nonlinear Dynamical Systems 
From Transient-Response Data,’’ by Marvin Shinbrot, NACA TN 
3288, 1954. 

7 “Dynamic Stability and Control Research,” by W. F. Milliken, 
Jr.. presented at Third International Joint Conference of RAS-IAS, 
Brighton, England, September 3-14, 1951, Cornell Aero. Laboratory, 
Inc., CAL-39; also issued in Anglo-American Aero. Conf. Report, 
1952, pp. 447-524. 


Discussion 
E. 8. Surrn.' 


The author’s method for handling nonlinearities 


5 Ordnance Engineer, Ballistic Research Laboratories, Aberdeen 
Proving Ground, Md. Fellow ASME. 


551 


appears to be new and of wide usefulness. However, the method 
is of such generality that a short application to a typical control 
is needed in the closure for the paper to be of most use to its 
readers. 


AuTHorR’s CLOSURE 


In response to Mr. Smith’s suggestion, the first example given 
in the paper will be worked out in greater detail. The data of 
Fig. 2 are given in Table 1. 


TABLE 1 


Borys 
C555 


In accordance with Equation [14] the method functions were 
chosen of the form 


= sin? w,f... .. [18] 


over the first part of the interval, and as zero over the part of the 
interval after the last zero of y, in 0 < ¢ < 2, as was discussed 
following Equation [15]. 


TABLE 2 

8 9 10 11 

Qn 52 25% 
2 


12 


The frequencies used in Equation [18] are given in Table 2. 
The frequency w, is not entered in the table since y, = 0 and so 
offers no information. 

For this example, Equation [13] becomes 


T T T T 
dt + ho + hy vat? dt + dt 


-—f, dt,n = 2,...,16... [19] 


From Table 1, the quantities z* and z* can be found, and then the 
integrals occurring in Equation [19] can be computed.* This 
computation gives the results displayed in Table 3. 

Incidentally, it can be seen that the entries in Table 3 have 
from four to six significant figures, while the original data of 
Table 1 have fewer. Naturally, then, not all the figures of Table 
3 have meaning, but they have been carried through the compu- 
tation and roundoff is only performed at the end. 

With parenthesized numbers referring to columns in Table 3, it 
can be seen that Equations [19] are equivalent to 


(2)b + (3)ko + (4)ki + (S)k2 = (6), n = 2,..., 16 


Hence, if these equations are to be solved by least squares, one 
must solve the Equations 


* Simpson's rule of course can be used for this computation, but 
the method of L. N. G. Filon is better suited to it. For this method, 
see “‘On a Quadrature Formula for Trigonometric Integrals,’ by L. 
N. G. Filon, Proceedings of the Royal Society of Edinburgh, vol. 19, 
1928-1929, pp. 38-47; “Integral Transforms in Mathematical Phys- 
ics,”” by C. J. Tranter, John Wiley & Sons, Inc., New York, N. Y., 
1951. The method is also described in an appendix to reference (6) 
of the present paper where tables of integration coefficients are given. 


t z(t) x(t) 
—0 0503 
0455 —0 0285 
0833 —0.0232 
1070 0152 
1140 —0 0065 
1053 0.0025 
0835 0.0100 
0530 0.0158 
0188 0.0188 
0148 0.0195 
0.0180 
Se 3x 30% 10r Tr , 


TRANSACTIONS OF THE ASME 


552 
TABLE 3 

(1) (2) (3) (4) (5) (6) 

n —S ynz dt S ynz dt S ynx? dt S ynz' dt — Synz dt 
2 — 0.007550 —0.001075 0.0016715 —0.00000265 —0.107630 
3 — 0.025990 — 0.006730 0.0018970 0. 2 —0.442175 
4 —0.124265 —0.009360 0.0021645 0.00003525 —0.757450 
5 —0.201610 0.015110 0.0022380 0.00011815 0.392470 
6 —0.049340 0.022210 0.0024005 0.00017315 1.099785 
7 —0.027140 0.013860 0.0024895 0.00018030 0.721525 
8 — 0.000660 0.015030 0.0028400 0.00017660 0.823325 
9 0 0.011240 0.0033385 0.00017030 0.608745 
10 0.005105 0.012640 0.0032335 0.00016535 0.693960 
ll 0.002965 920 0.0026770 0.00016080 0.552885 
12 0.007775 0.011685 0.0025320 0.00015970 0.659540 
13 0.006545 0.009275 0 .0023670 0.00014355 0.514865 
14 0.010500 0.011040 0.0023815 0.00012490 0.586380 
15 0.010765 0.009805 0.0023400 0.00010480 0.500355 
16 0.007790 0.010705 0.0023440 0.00009305 0.554280 


(2) = 0.06041725 
X (3) = —0.00263175 
X (4) = —0.000841900 
2(2) (5) = —0.0000350735 
(6) = —0.175448 
X (3) = 0.00221335 
>(3) X (4) 0.000360318 


bd(2) X (2) + XK (3) + X (4) 
+ keX(2) X (5) = 2(2) X (6) 

bd(3) (2) + koX(3) XK (3) + X (4) 
+ k:2(3) X (5) = 2(3) X (6) 

K (2) + ked(4) X (3) + X (4) 
+ (5) = (6) 

(2) + koX(5) X (3) + X (4) 
+ X (5) = X (6) 


2(3) X (5) = 0.0000226548 
(3) X (6) = 0.113807 

X (4) = 0.0000935345 
X (5) = 0.00000474140 
2(4) X (6) = 0.0175061 
X (5) = 0.000000272438 
X (6) = 0.00114957 


. . [20] 


The values of the sums occurring here are given below Table 3. 
Substitution of these sums into the appropriate places in Equa- 
tions [20] and solution of the resulting equations gives the values 
of the parameters quoted in the body of the paper. 

More examples with even further detail are given in reference 
(6). 

The author of (5) has asked that the following be mentioned 
as a more readily available source of his article.’ 

7 ‘*The Attenuation of Damped Free Vibrations and the Derivation 
of the Damping Law From Recorded Data,” by K. Klotter, Pro- 
ceedings of the Second U. 8. National Congress of Applied Mechanics, 
1954. 


Physical and Mathematical Mechanisms of 
Instability in Nonlinear Automatic 
Control Systems 


By R. E. KALMAN,' NEW YORK, N. Y. 


This paper is a critical examination of the stability prob- 
lem in automatic control systems containing nonlinear 
elements. An attempt is made to classify and isolate 
essentially different phenomena, and to illustrate each 
type by means of simplified but representative examples. 
Particular attention is paid to the effects of system param- 
eters and system inputs in provoking or destroying in- 
stability. The phenomena discussed are so diverse that 
they defy any over-all conclusions. Still, it is hoped that 
the tentative classification adopted here will be of help in 
recognizing important and unimportant aspects of prob- 
lems in nonlinear control-system design. 


GLossaryY or Spectra Terms 


Nonlinear Differential Equation. The set of relations 


da, 


= fm,...2,) (6 m1,...n) (*) 


where f; is some function of the variables x, .. . z,; f; must be 
specified in each case. 

Phase Space. An n-dimensional Euclidean space where the 
Cartesian co-ordinates of a point are the x, .. . Z,, also called 
‘phase-space variables.’’ Physically, the z, .. . 2, determine 
n different stored energies in a lumped-parameter system. It is 
also said that the 2, .. . z, represent the “initial conditions’’ of 
the system. The phase space is the set of all possible initial con- 
ditions or “states’’ of a dynamic system. 

Trajectory. A curve in the phase space which is specified in 
parametric form by a set of transient solutions of (*): 2(t), . . . 
z,(t); it starts at the point z(t = 0),...2,(¢ = 0). The phrase, 
“the trajectories move’’ (or “the trajectories tend to... ”’) 
refers to the paths described by the trajectories as ¢ changes 
monotonically (usually t— + @ ). 

Limiting Behavior. Description of motion of trajectories as 
+ 

Critical Point. A point in phase space where all the time 
derivatives z(t), . . . #,(¢) vanish simultaneously; sometimes 
called a “singular point.’’ 

Equilibrium Point = critical point—a point where a dynamic 
system is at rest. 

Limit Cycle. A nonself-intersecting closed curve in the phase 
space which is a trajectory. It is “stable’’ if trajectories in its 
neighborhood tend to it ast—> + © ; itis “‘unstable’’ if trajectories 


1 Department of Electrical Engineering and Electronics Research 
Laboratories, Columbia University. 

Contributed by the Instruments and Regulators Division of Tae 
American Soctety oF MECHANICAL ENGINEERS and presented at the 
ASME-AIEE Conference on Nonlinear Control Systems, Princeton, 
N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, 
February 1, 1956. Paper No. 56—IRD-16 


in its neighborhood tend to it ast» — ©. In the time domain 
the limit cycle corresponds to a periodic function. 

Types of Critical Points. Critical points may be classified with 
the help of the characteristic roots of the linear differential equa- 
tion with constant coefficients which govern the motion of trajec- 
tories near the critical point. 

(a) Node. The characteristic roots are real and of the same 
sign. 

(b) Focus. The characteristic roots are complex conjugate. 

(c) Saddle Point. The characteristic roots are real and of 
opposite sign. 

Nore: This terminology applies only to critical points in 
two-dimensiona] phase space. 

Stability of Critical Points. If the real part of each of the char- 
acteristic roots of the linear differential equation associated with 
the critical point is negative, the critical point is stable; if the 
rea! parts are all non-negative, the critical point is unstable. A 
saddle point is neither stable nor unstable. 

The terminology defined in the foregoing is standard through- 
out the majority of the literature of nonlinear differential equa- 
tions and nonlinear mechanics. For additional information con- 
sult references (1 to 4).* 


1 INTRODUCTION 


This paper is concerned with automatic control systems by 
which are meant servomechanisms, regulators, analog com- 
puters, gun-directors, autopilots, etc. Such systems may be 
described by the adjectives active, deterministic, dynamic. 
The class of systems treated in the paper is required to be, in 
addition, also time-invariant and lumped-parameter. Under 
such restrictions, the behavior of a system over time will be 
governed by an autonomous ordinary nonlinear differential 
equation. 

The paper itself might be called a study of nonlinear differential 
equations. While correct, such a viewpoint would be very arti- 
ficial, since all of the discussion which follows is motivated by 
practical engineering problems. To be sure, it will be unavoid- 
able to deal with such problems in a somewhat abstract way, 
assuming the reader’s familiarity with equipment used in the 
automatic control art. 

A nonlinear system is defined by the fact that the principle of 
superposition does not apply. In other words, a linear com- 
bination of two solutions of a nonlinear differential equation is 
not, in general, a solution. 

Further qualifications are necessary to arrive at the heart of 
the problem. As a consequence of the principle of superposi- 
tion, the response of a linear system to a step of any magnitude 
can be obtained by merely scaling the unit step response. More 
loosely, it may be said that all step responses look alike. In a 
nonlinear system, the step responses will not, in general, be 
scalar multiples of one another, but it may happen that they 
still look alike. Whenever this is true, the effect of the nonlineari- 


2 Numbers in parentheses refer to references at end of paper 


553 


554 


ties in the system is of minor importance practically and of limited 
interest theoretically. It also may happen, however, that the 
step responses will differ in some essential fashion; for instance, 
they may be nonoscillatory for a step input of sufficiently small 
magnitude and oscillatory for a step input of sufficiently large 
magnitude. (See Example 1 of reference 1.) 

Going still further, nonlinear systems may be scrutinized 
according to what happens to the transient solutions as the time 
approaches infinity. Very crudely speaking, this is the notion of 
stability. It may happen that all transient responses tend to a 
unique equilibrium point in the phase space, regardless of initial 
conditions or the magnitude of the input. But it also may 
happen that not all, if any, transient solutions will tend to a 
unique equilibrium point. 

If there are only two energy storage elements, i.e., the system 
obeys a second-order nonlinear differential equation, then 
classical results due to Poincaré and Bendixson (2, 3) state that 
the limiting behavior of unforced solutions (trajectories) of the 
differential equations as the time t > @ has the following pos- 
sible forms: 


(a) The trajectories may tend to one or more stable equilib- 
rium points. 

(b) The system may oscillate continually, the trajectories 
tending to a limit cycle in the phase space. 

(c) The trajectories may tend to infinity. 


When the differential equation is of higher than second order, 
results similar to the foregoing are not yet available, but, in the 
author’s opinion, there is little reason to expect surprises (4). 
Much more serious is the restriction that the foregoing results 
apply only to the autonomous behavior of a system; i.e., in the 
absence of an externally applied forcing function or input signal. 
Since most control systems depend very crucially on the inputs, 
this would seem to represent a serious difficulty. If, however, 
the input is of a simple type, such as a combination of a step and 
a ramp, it is usually still possible to carry out an analysis in such 
a fashion that the magnitudes of the steps and ramps become 
parameters of the system and affect its entire dynamic structure. 
This is illustrated in Example 1 and the same procedure is used 
without additional remarks in the other examples (see also refer- 
ences 1, 4). 

Linear systems behave either according to (a) or according to 
(c) but not both. Nonlinear systems which behave similarly 
will be called monosiable. Thus linear systems are always mono- 
stable. A monostable nonlinear system is topologically equiva- 
lent to a linear system; in other words, there exists some one- 
to-one bicontinuous transformation which deforms the trajec- 
tories of a monostable nonlinear system into those of a linear 
system. An example of a monostable nonlinear system is the 
equation: # + az + ba + cz* = 0(a, b,c > 0). 

If a nonlinear system is not monostable, all three types of 
limiting behavior may be encountered. For instance, van der 
Pol’s equation ¢ + k(x? — 1)¢ + z = 0 is not monostable since 
it is known that all unforced solutions tend to a stable limit 
cycle when k > 0. 

The fact (b) that sustained oscillations may be possible is one 
of the great discoveries of nonlinear mechanics. It is a physical 
phenomenon with which a control-system designer is well ac- 
quainted; he regards it usually as a nuisance, seldom as a friend. 
Case (c), on the other hand, is mathematical fiction indicating 
continual increase in energy storage which cannot take place in 
a physical universe. Realistically speaking, however, the 
existence of Case (c) is a tipoff that the equipment has a pro- 
nounced tendency to burn fuses or go up in flames—hardly to be 
ignored by the engineer. 

The three cases listed are bothersome problems for the non- 


TRANSACTIONS OF THE ASME 


linear control-system designer. His aim is to make the system 
behave in a uniform, predictable fashion—as much like a linear 
system as possible. (Sometimes his viewpoint may be different, 
for instance when building an oscillator. These cases will not 
be of interest here.) To wit, he has to make sure that there is 
only one stable equilibrium point, that no sustained oscillations 
are possible, and certainly that there cannot be explosively in- 
creasing energy storage. In other words, from his standpoint 
any limiting behavior other than a unique stable equilibrium 
point may be referred to as an “‘instability.”’ 


2 Tue THeme or THE PAPER 


What are the physical and mathematical mechanisms leading 
to some form of instability, which are of interest in the design of 
current-day automatic control equipment? 

Since the equilibrium points are readily calculated from the 
differential equation itself, avoiding more than one stable equilib- 
rium point is a simple matter and does not merit further dis- 
cussion. Undoubtedly the most important instability of in- 
terest in control systems are limit cycles; the circumstances which 
lead to the creation or destruction of limit cycles are to be studied 
in considerable detail. Solutions tending to infinity are also 
fairly readily avoidable and will receive much less attention. 

The discussion will proceed by means of highly idealized, but 
typical, examples. In any realistic situation it is to be expected, 
of course, that none of the phenomena studied will occur in its 
purest form but is likely to interact and be more or less obscured 
in the over-all performance of the system. 

The examples are grouped into several categories which, in 
the author’s belief, represent basically different phenomena. The 
real usefulness of such a classification can of course be decided 
only after much further experience and study. 

Most of the analysis will be carried out by means of the phase- 
plane technique (1 to 5); in particular, heavy use will be made 
of the ideas and terminology of the author’s recent paper (1) 
in which organization of the phase-plane analysis of a wide class 
of systems is simplified and systematized. 


Gatn-ConTROLLING NONLINEARITIES 


A very commonly encountered situation is the nonlinear control 
system shown in Fig. 1. The system consists of a linear transfer 
function G(s) which accounts for all the energy storage, and a 
nonlinearity f(e) which does not depend on time. It is sometimes 
stated that f(e) is a “static’ nonlinearity; this terminology is 
misleading and confusing because nonlinearities can be specified 
independently of any energy storage—provided only that the 
over-all system is describable by a differential equation.* 


Fic. 1 


3.1 Sufficient Condition for Stability and Necessary Condition 
for Instability 

If f(e) is a single-valued, continuous, differentiable function, its 
slope f’(e) defines an incremental loop gain. By this is meant the 
following: As long as the transient stays in the neighborhood of 
e = é in the phase space, the nonlinearity may be replaced by a 
constant K = f’(e), and the behavior of trajectories near e = eo 
will be governed by a linear differential equation (with constant 


* Separating nonlinearities from energy storages may lead to 
rather complex block diagrams. 


APRIL, 1957 


coefficients) whose characteristic roots depend on K and G(s). 
This self-evident and seemingly naive notion actually has im- 
portant consequences: 

(a) If f(e) in Fig. 1 is replaced by constants K corresponding 
to all possible values of f’(e), and it is found that the closed-loop 
system is stable for all such K, then it is intuitively clear that 
the system must be monostable; i.e., all transient solutions will 
converge to a unique, stable critical point. 

(b) If the closed-loop system is unstable for some K and stable 
for other K, i.e., if the roots of the characteristic equation have 
both positive and negative real parts depending on K, then one 
would expect that the system may not be monostable. 

Since K is a factor in the closed-loop gain of the system, it is 
logical to call f(e) a gain-controlling nonlinearity. The usefulness 
of the concept is dependent, of course, on being able to determine 
quickly the closed-loop roots as a function of K. Fortunately, this 
problem has been basically answered by the recently developed 
root-locus method (5 to 7). 

To prove the foregoing statements, one may proceed as fol- 
lows: Suppose that the nonlinearity f(e) is approximated by 
straight-line segments, as in Fig. 2. Each of these segments 
corresponds to a fixed value of K. The approximation may be 
thought to be equivalent to sampling the root locus; there must 
be a sufficiently large number of values of K (i.e., straight-line 
segments) to preserve the essential features of the root locus. 


Fic. 2 


To each straight-line segment there corresponds a region in 
phase space within which the transient solutions obey a linear 
differential equation. The characteristic roots of these linear dif- 
ferential equations are determined by the particular slope K of 
the straight-line segment. It is well known that all trajectories 
of an autonomous stable linear differential equation tend to a 
unique critical point in phase space as the time t—> ©. Where 
is the critical point of a given region? Suppose the critical points 
of the nonlinear system satisfy f(e) = a, where a is a real con- 
stant.‘ Then the value of e corresponding to the critical point 
of each phase-space region is determined by the intersection of 
the (extended) straight-line segments with the line a = const, as 
shown in Fig. 2. Intersections lying on segments of the straight- 
line approximation of f(e) define actual critical points of the 
system (P, in the figure). All other points may be called virtual 
critical points, since they lie outside the phase-space region to 
which they belong and therefore they can never be reached by 
their own trajectories. It will be convenient to use brackets to 
distinguish virtual critical points [(P:2), (Ps) in Fig. 2] from actual 
ones. Further details of this procedure may be found in an earlier 
paper by the author (1). 

Now suppose that the nonlinearity f has been approximated 
with a suitably large number of straight-line segments. Suppose 


In general, e will be a linear combination of the phase-space 
co-ordinates; additional conditions (e.g., @ = 0, @ = 0, ete.) are 
needed to fix the specific value of each of the phase-space co-ordinates 
at the critical point. But this is of no interest here, because the 
effect of the nonlinearity depends only on the value of e at the critical 
point. 


555 


as in assumption (a), that the critical point of each phase-space 
region associated with the segments represents a stable linear 
system, regardless of whether the critical point is actual or vir- 
tual. Take an actual critical point. Find all trajectories be- 
longing to the critical point and extend each trajectory back- 
wards in time until the boundaries of the region are reached. 
If the adjacent region also had an actual critical point [which 
must be stable by assumption (a)], then the trajectories some- 
where along the boundary must point in two different directions. 
This would contradict the fact that the solutions are unique 
when the nonlinearity is continuous. Hence all regions adjacent 
to a region with an actual critical point will have virtual critical 
points because of assumption (a). Continuing the same process, 
it follows that the system can have but one actual critical point. 

To conclude that the system is monostable, it is also necessary 
to show that there are no limit cycles. This follows more easily 
from a physical argument. If there were a limit cycle, then the 
net energy lost in traveling around it in the phase space must be 
zero (cf. also Example 1). But under assumption (a), the system 
is everywhere dissipative and the net energy loss along any curve 
in the phase space is positive. Hence there cannot be any limit 
cycles. 

The arguments are clearly not affected by the number of non- 

linearities in the system. They depend crucially, however, on 
the continuity of the nonlinearities (cf. also Examples 4 to 6). 
The foregoing is summarized by the following: 
+ Theorem. An n-th order ordinary nonlinear differential equa- 
tion containing an arbitrary number of single-valued, continuous 
nonlinearities is monostable if the incremental linear differential 
equation at each point in the phase space is stable. 

Proposition (b) is simply the converse statement of this theo- 
rem; i.e., the theorem implies the following: 

Corollary. The differential equation just referred to fails to be 
monostable only if the incremental linear differential equation is 
unstable in some region in the phase space. 

Observe that the theorem states a sufficient and the corollary 
a necessary condition. It is curious that this intuitively obvious 
and yet very general and useful result has apparently not been 
proven until now in the engineering literature. 

Monostable systems are of no interest in this paper. Examples 
1 to 3 which follow are nonlinear systems which are not mono- 
stable, but where the nonlinearities are continuous. 

3.2 Examples of Systems With Gain-Controlling Nonlinearities 

Example 1. As the first illustration, consider a control system 
such as shown in Fig. 1, assuming arbitrarily that 


G(s) = (1 — s)/s(1 + 8) 


f(e) = saturation curve 


In physical terms, one might think of the situation as involving 
the control of a system with integration and deadtime, where 
the effectiveness of the controlling variable e(¢) is greatly reduced 
beyond a certain linear range—one says that e “saturates.’’ 
Such a system would have the transfer function G(s) = €-**/s; 
for convenience, however, €~* is approximated with a lumped- 
parameter transfer function® (1 — s)/(1 + s), which agrees with 
the first three terms of the Taylor expansion of ¢~* about s = 
0. The saturation curve is shown in Fig. 3; a saturation curve 
may be specified mathematically by 


fle) = —f(—e) 
f'(e) = a nonincreasing positive function of | e| 
max f’(e) = f’(0) 


5 Actually, simple problems involving deadtime can be treated 
directly in the phase plane. See Eckman (8). 


jw 


s-PLANE 


TRANSACTIONS OF THE ASME 


INCREASING 


lel K=f'(o) 


4 


A glance at the root-locus curve for the system (Fig. 4) shows 
that if f’(0) is sufficiently large and if f’( | e| ) is nonincreasing, the 
root locus as a function of |e| will be initially in the right half 
plane and later in the left half plane. This suggests that it is 
sufficient to take straight-line segments with only two different 
slopes, as shown in the figure, to approximate f(e). The straight- 
line approximation is denoted by f*(e). If a more detailed study 
of trajectories is called for, then it may be desirable to take as 
many as four different slopes to account for the fact that the 
linear system corresponding to each straight-line segment in the 
approximation of f(e) may have the following types of critical 
points: Unstable node, unstable focus, stable focus, stable 
node (arranged according to decreasing f’(|e| ) (1). 

From Equation [1], the differential equations of the system are 


r(t) — c(t) = e(t) 
f*(et)) = y(t) 
&(t) + &t) = y(t) — wd) 


The dots denote differentiation with respect to time. 

The problem of considering the effect of inputs on the dynamic 
structure of the system has already been pointed out in Section 1. 
Briefly, the procedure is as follows: The input is to have some 
simple functional form, which depends on a number of param- 
eters. When the input function so specified is substituted in the 
differential equations of the system, the input parameters become 
system parameters. The procedure is successful only if the re- 
sulting differential equations are time invariant. This requires, 
in the present case, that the input consist of combinations of steps 
and ramps only. Using the constant r to denote the magnitude 
of the step component and # the magnitude of the ramp com- 
ponent of the input, the class of admissible inputs is given by 


t2>0 


Noting Equation [3], y and ¢ are eliminated from Equations 
[2] bearing in mind that the derivative of f*(e) is discontinuous 


= —f(e)—(1— if |e] < 
= —f*(e)—(1— + # if |e] 
K, and K, are defined in Fig. 3. The critical points of Equations 
[4] are located at 
and 
There are several interesting cases arising in connection with 


the system [4]. 
(i) Let K, > 1 and Ky < 1; also let r(t) =0. Then the root 


locus will be in the right half plane for small |e] and in the left 
half plane for large |e|. Also, Equations [4] satisfy the con- 
ditions of a well-known theorem of Levinson and Smith (9), and 
it follows at once that there exists a unique, stable limit cycle 
which represents the limiting behavior of all trajectories. 

It is much more instructive, however, to employ the idea of 
actual and virtual critical points mentioned in Section 3.1 to 
explore the stability of the system. There will be always one 
actual and two virtual critical points; the former always un- 
stable, the latter stable. It does not matter much whether the 
critical points are foci or nodes; for purposes of illustration it 
may be assumed that the actual critical point is an unstable 
focus, F,~, and that the virtual critical points are stable nodes 
(N.*), (Ns*). The resulting trajectories are sketched in Fig. 
5. If e > e, Region 3, the trajectories will tend to (N;*); if 
e < —e€, Region 2, they will tend to(N2*). (N2*) is in Region 3 
and (N;*) is in Region 2. It follows from the geometry that all 
trajectories will ultimately enter Region 1, |e| < ¢. Since the 
critical point in this region is an unstable focus F,~, the trajec- 
tories cannot remain in Region 1 but ultimately must leave it. 
Thus a situation has arisen where the trajectories cannot end up 
at a critical point since the only actual critical point is unstable; 
moreover, they cannot tend to infinity since the virtual critical 
points are stable. The only possible limiting behavior for the 
trajectories then is to tend to a stable limit cycle. 

The argument used here may be sharpened by a device due 
originally to Poincaré: Since the system is stable for large le| ; 
there exists a very large closed curve in the phase plane such that 
all trajectories cross it toward the inside. Moreover, since the 
actual critical point of the system is unstable, there exists a very 
small closed curve in the neighborhood of the critical point, con- 
taining the critical point, such that all trajectories cross it 
toward the outside. This implies that there exists at least one 
stable limit cycle in the annular region between the two closed 
curves.6 The two closed curves in question are also shown in 
Fig. 5; the large one is diamond-shaped, the small one is a circle. 
By examining the limiting behavior of the trajectories, it is ap- 
parent that the limit cycle (which is not sketched, to avoid con- 
gestion) has a shape roughly midway between that of the two 
closed curves. Unfortunately, the construction of such closed 
curves is a delicate analytical task and therefore definitely not 
recommended as a practical procedure. The concept of actual 
and virtual critical points, however, is very easy to handle and it 
can be extended readily to systems of higher than second order 
(4) although this cannot be discussed here in detail. See, how- 
ever, Example 2, which follows. 

Notice that the limit cycle lies in all three regions. This sug- 
gests a simple physical explanation of instability. The limit 
cycle exists if and only if the energy gained during the time the 
trajectory was in Region 1 (where the damping is negative) is 


* Reference (2), p. 245. 


556 
| Ke A 
Ni 
| ® ©) 
Fia. 3 Fie. 5 
} 
de 
| 


APRIL, 1957 


K,. Kz <I 
Kp > 


Fie. 6 


precisely equal to the energy lost while the trajectory was in 
Regions 2 and 3 (where the damping is positive). This is as 
far as it is possible to go with intuitive physical explanations. 
To show, for instance, that the limit cycle is actually stable (or 
that it can exist at all—not a trivial problem as will be seen in 
Case ii), it is necessary to use arguments which are explicitly 
or implicitly the analog of the phase-plane picture Fig. 5. 

(ii) The question arises as to what the effect of a nonzero 
input is. If the input is a step of any magnitude, then there is no 
effect whatever since the constant r simply does not appear in 
Equations [4]. This is because the control system discussed has 
zero steady-state error in response to step inputs (1, 4). 

If the input is allowed to be a ramp, the situation will change 
because * appears as a parameter in Equations [4]. Now if the 
actual critical point as given by Equation [5] is still located at 
jel < eo, all the discussion in connection with Case (i) remains 
valid, although the phase-plane picture will become distorted. 
If the actual critical point is at |e] > eo, the situation changes 
abruptly, because now the unstable actual critical point has be- 
come stable. The same “exchange of stabilities’’ will take place 
in regard to the virtual critical points.’ If there exists a limit 
cycle it must enclose an actual critical point. If the critical point 
is stable, the limit cycle enclosing it must be necessarily unstable. 
Since there is no intuitive reason to expect an unstable limit cycle 
to occur, it seems that increasing |#| destroys the stable limit 
cycle. The rigorous analysis is as follows: 

As? increases from 0, all critical points move to the right but the 
linear regions remain unchanged. As f*~'(7) approaches the up- 
per breakpoint ¢ in the approximated saturation curve shown in 
Fig. 3, the distance between (N;*) and F,~ decreases; in fact, 
(N;*) moves from Region 2 into Region 1. By examining the 
trajectories belonging to Regions 1 and 3, it is easily seen that as 
(N;*) and F,~ approach one another, the stable limit cycle be- 
comes smaller and smaller until it coincides with the actual 
critical point when * = f*(e). If we let * > f*(e), the limit cycle 
disappears altogether and the critical points become (F,~), N3*, 
and (N,*), all being located in this order on the e-axis in Region 3. 
Thus the unstable actual critical point and the stable limit cycle 
coalesce and give rise to a stable critical point as || increases 
from zero. After Poincaré, phenomena of this type are called 
bifurcation (2, p. 73). 

The dependence of stability of a nonlinear system on a param- 
eter and, in particular, the creation and destruction of limit 
cycles is an important, profound, and relatively little explored 
question of the theory of nonlinear differential equations. A 
brilliant recent study by Duff (10) suggests that useful results 
can be obtained by quite simple means. In fact, the foregoing 
analysis is motivated by Duff’s work, although his technique is 
not directly applicable here. 

(iii) To explore further the problems raised in the last case, 


7If f(e) rather than its approximation f*(e) is used in Equation 
[5], the transition is not abrupt; otherwise the situation is the same. 


Fia. 7 


Fie. 8 


the nonlinearity is replaced by that shown in Fig. 6 where a 
straight-line approximation has been made already. The dif- 
ference is that now the root locus begins in the left half plane, 
moves into the right-half plane, and finally returns to the left 
half plane as |e| increases from zero, since f’(|e| ) is not mono- 
tonic. Three different values of f’(|e| ) are sufficient to sample 
the root locus (as was assumed in drawing Fig. 6), but if only two 
values had been taken, some aspect of stability would have been 
lost. 

Assume again r(t) = 0. The origin is a stable actual critical 
point and there are stable virtual critical points corresponding 
to K;. But there are also unstable virtual critical points cor- 
responding to Ky. Hence one may suspect that possibly not all 
trajectories will reach the origin. Should this be correct, it is 
possible to construct three nested closed curves as sketched in 
Fig. 7. If such a set of closed curves exists, then all trajectories 
leave Region 1 and tend to a stable limit cycle in Region 2. All 
trajectories entering Region 4 end up at the stable critica] point 
at the origin. There exists an unstable limit cycle in Region 3 
such that trajectories on its outside tend to the stable limit 
cycle and on its inside to the critical point at the origin. The 
existence of the largest and smallest such closed curves may be 
confidently assumed by virtue of arguments used in Case (i). 
But there are no a priori grounds for assuming the existence of a 
closed curve in the middle. This is strictly a quantitative ques- 
tion. If K, >1 and e;/e, >1 (see Fig. 6), then the existence of 
such a closed curve appears plausible on physical grounds. As 
either one of the parameters K2 or e;/e is decreased, a situation 
similar to Case (ii) may be expected to occur. The two limit 
cycles approach one another and at some stage, when K, > 1 and 
€,/eo > 1 still hold (i.e., when there is still a pair of unstable virtual 
critical points), the limit cycles coincide. If the parameters are 
further decreased, the limit cycles disappear and, a fortiori, it 
will not be possible to construct a closed curve such that all tra- 
jectories cross it toward the outside. 

The discussion of the effect of ramp input is quite similar to 
that of Case (ii) and may be omitted. 

Example 1 has shown how the stability of a system may be in- 
fluenced by the nonlinearity, keeping the transfer function un- 
changed. The converse situation is also interesting. 

Example 2. The system is again as shown in Fig. 1. Let f(e) 
be a saturation-type nonlinearity, Fig. 3. Consider two different 
open-loop transfer functions: 

(i) G(s) = A/s(s + b) (s + c) (A, b, ¢ = positive constants). 
The corresponding root locus is sketched in Fig. 8. If A is suffi- 
ciently large, the root locus will start in the right half plane and 
move monotonically into the left half plane with increasing | e| as 
indicated in the figure. Arguments similar to those used in 
connection with Case (i), Example 1, show that there exists a 
unique stable limit cycle surrounding an unstable critical point in 
the phase space. By making A sufficiently small, the critical 
point becomes stable and the limit cycle is destroyed. 


558 


(ii) G(s) = A(s + d)(s + g)/s(s + 6) (8 (A, b,¢,d,g = 
positive constants, 0 <c<d< gq). In this case the root 
locus is more complicated and, provided c < d, it will have the 
qualitative shape sketched in Fig. 9. 

When A is sufficiently large, the root locus will start in the left 
half plane, enter the right half plane, and then return to the 
left half plane as |e| increases. The assumption c< d guaran- 
tees that the reot locus will remain in the right half plane for 


lel 
INCREASING 
K=f'lol 


s~ PLANE 


Fic. 9 


a large range of |e] ; arguments similar to those used in Case (iii) 
Example 1, show that there exists an unstable and a stable limit 
cycle in the phase space in addition to a stable critica] point. 

If the condition ¢e< d is not satisfied with a sufficient margin, 
then the root locus will not remain in the right half plane long 
enough and no limit cycle will exist. In other words, as the zero 
at —d, originally deeply in the left half plane, moves along the 
real axis toward the origin, the limit cycles in the phase space 
will move closer and closer together until they coincide and there- 
after vanish. It should be noted also that Kochenburger’s ap- 
proximate stability criterion (11) implies the existence of a stable 
and an unstable limit cycle as long as the root locus crosses the 
imaginary axis twice. In reality, the preceding discussion shows 
that the limit cycles will vanish before the root locus leaves the 
right half plane entirely. The exact sufficient conditions for the 
existence of limit cycles would be extremely difficult to find and 
should best be left to computer studies in particular cases. 

Finally, as A decreases, it may happen that the root locus starts 
in the right-half plane. Then the situation is the same as Case 
(i) discussed previously. 

Example 3. Ina recent paper, Lewis (12) suggests improving 
the performance of a conventional second-order servo by making 
the damping ratio small or even negative when the error is large. 
The equations of such a system in the (e, €) phase plane are 

= 6; = —[2(b — ale )é + e] (step inputs only). . [6] 

Inspection of Equation [6] shows that trajectories which re- 
main in the region b — al\e| > 0 tend to the unique critical 
point at the origin of the phase plane; but if a trajectory enters 
the region b — a|e| < 0 it may diverge to infinity, i.e., there may 
be “explosive’’ instability. This has been confirmed by means of 
an analog-computer study (13). Indeed, by detailed analysis of 
Equation [6], which is too complicated to be given here, it is 
possible to establish the existence of a unique, unstable limit 
cycle. 

These illustrations show the usefulness of the root-locus con- 
cept in obtaining a quick feeling for possible instabilities in a 
control system. Recall, however, that the nonlinearities must 
be continuous. A much richer storehouse of phenomena is en- 
countered in the next sections where discontinuous nonlinearities 
of various types are studied. The systems considered in Sections 


TRANSACTIONS OF THE ASME 


4 and 5 could be converted to systems containing continuous 
nonlinearities, though only at the expense of working in a higher- 
dimensional phase space. In the cases discussed in Sections 6 
and 7, the discontinuity is in a sense inherent in the problem and 
cannot be eliminated. 


4 Systems WuHose Governinc Equations CHance Discon- 
TINUOUSLY 

In the preceding examples, the same differential equation was 
valid over the entire phase space, so that when the nonlirearities 
are approximated with a sufficiently large number of straight-lire 
segments, the properties of the corresponding linear systems in 
adjacent regions of the phase space change in a continuous man- 
ner. One way of relaxing the continuity is by letting the svstem 
be governed by two entirely different linear differential equations 
in adjacent, nonoverlapping regions of the phase space. Probably 
the simplest example is provided by coulomb (‘“‘dry’’) friction. 

Example 4. Consider a simple positional servomechanism, 
where the output element, in addition to inertia, is subject to 
combined coulomb and viscous friction. The torque 7 to the 
output element is supplied by an ideal source which possesses no 
dynamics. The system is shown in Fig. 10. The nonlinear- 
friction characteristic, denoted by f, is shown in Fig. 11. If the 


TORQUE 
SOURCE 


& AMPLIFIER 


output element is not moving, torque 7) is necessary to overcome 
static coulomb friction and start the motion. Once the output 
element is moving in either direction the coulomb friction drops 
to its dynamic value T,; < 7 which then remains constant for all 
é > 0; to this is added the viscous friction which may be assumed 
to vary linearly with the velocity ¢. Actually, Fig. 11 is a highly 
idealized picture of a complicated physical situation, about which 
it is difficult to obtain reliable data. The discontinuity could be 
lessened by introducing negative slopes as shown in Fig. 11 with 
dashed lines; this, however, would not affect materially the 
analysis which follows and is therefore omitted. 

The differential equations of the motion in the (c, é) phase 
plane may be written down by inspection, assuming that the in- 
put is given by Equation [3] 


de 
dt 


= é and Kir +H — 0) — KO) 


|_| 
Te 
Fig. 10 
f(¢) 
T, 
Fie. 11 


APRIL, 1957 


The simplicity of these equations is treacherous; indeed, Equa- 
tions [7] only hold when 


é#0oré =Oand .. [8] 


When K|r + #t —c| < Tno motion can take place because the 
torque is insufficient to overcome static coulomb friction. The 
situation becomes easier to visualize when the problem is re- 
stated in the (e¢, é) phase plane, where the system equations are 


de dé ‘) w 
a and Ke — fle) oF 19a) 
} 


é = 0 and Kle| > 


0 when é = 0 


and < 


The critical points of Fquations [9a] are at 
1 
é =f and 


Eliminating * from Equation [10], it is found that 


at the critical points; i.e., the critical points lie on the inverse 
friction curve in the (e, ¢) phase plane. Equations [9b] have 
no critical point. 

The critical points given by Equation [10] depend on the 
parameter 7. There are two cases to be considered. 

(i) * = 0. In the upper half of the phase plane, ¢ > 0, the 
trajectories obey a linear differential equation and belong to a 
virtual critical point located at e = 7,/K. By symmetry, the 
trajectories in the lower-half plane ¢ < 0 belong to a virtual 
critical point located ate = —7,/K. If the gain K is sufficiently 
large, the critical points will be stable foci; if K is sufficiently 
small, they will be stable nodes. If the critical points are foci, 
the trajectories spiral around the origin until they intersect the 
line segment é = 0 and |e| < 7)/K, at which instant the motion 
stops, because the rate of change of both e and é is then zero, 
according to Equation [9b]. In a sense, the entire line segment 
is a locus of critical points in this case. Physically, the situation 
is described by the phrase, ‘‘The system sticks,’’ i.e., the torque 
generated by the system is insufficient to overcome static coulomb 
friction after a certain time. A sketch of a typical trajectory is 
shown in Fig. 12. Only accidentzlly can a trajectory reach one 
of the critical points; hence they are to be regarded virtual in the 
sense of Section 3.1. Substantially the same result holds when 
the critical points are stable nodes. 

(ii) * # 0. This is a much more interesting case. Here 
de/di = * * O when the trajectory is on the line segment corre- 
sponding to static coulomb friction, so that the trajectory cannot 
stay there but must leave it ate = +7 /K. One of the critical 
points is always actual, the other virtual. If the critical points 
are stable nodes, the trajectory tends monotonically to N,* 
once it has left the static-friction segment and the system is 
stable for all 7. But if the critical points are foci and 7 is suffi- 
ciently small, the situation shown in Fig. 13 may arise. In 
other words, the trajectory leaving the e-axis at e = 7)/K and 
spiraling around F,* may reintersect the e-axis at |e| < 7o/K 
and a limit cycle results. Physically speaking, the system alter- 
nately sticks and moves and the resulting oscillation is decidedly 
nonsinusoidal in appearance. The limit cycle may be destroyed 


559 


by increasing the magnitude of 7, so that the trajectory leaving 
the e-axis at 7')/K and spiraling around the critical point cannot 
reintersect the e-axis. This is shown by the dashed trajectory 
in Fig. 13 which arises when the magnitude of the ramp is 7’. 
Alternately, for any fixed 7, the limit cycle also can be destroyed 
by letting the ratio T7)/7, approach unity. Thus the instability 
in such a system (i.e., the limit cycle) results from the interac- 
tion of several different elements in the system: (a) The dis- 
continuous friction characteristic; in particular, the fact that 
static coulomb friction exceeds dynamic coulomb friction; 
(b) the input must be of particular type, a small ramp in this 


Fis. 13 


case; (c) the gain must be such that the actual critical point is a 
focus. 

Further details of the problem are given by Lauer (14), who 
discusses it from a somewhat different point of view. 

The oscillations encountered here are frequently called “re- 
laxation oscillations,’’ by which is meant that ares of the limit cy- 
cle are contained in regions of the phase space where the motion 
is subject to different physical laws. -It is significant that the 
instability can only be brought about by a special type of input, 
and is absent when the system is subject to other inputs. Extrap- 
olating this observation suggests that many control systems 
with sufficiently strong nonlinearities may exhibit instabilities 
when subjected to a small, special class of inputs, which perhaps, 
are present only with a vanishingly small probability in the class 
of all inputs for which the system is designed. For instance, cer- 
tain control systems behave in an unsatisfactory fashion when the 
input is periodic (or a random signal with a very strong periodic 
component), while they perform quite well when the inputs are 
of a less special kind. Unfortunately, no serious study of such 
questions exists at this time. 


5 Cavsep BY MULTIVALUED NONLINEARITIES 


Interesting nonlinear effects arise when the trajectories starting 
at any point in the phase plane are not necessarily unique, but 
may depend on the “state’’ of the nonlinearity. Possibly the 
simplest example of such effects is afforded by backlash (dead- 
zone in gear trains). 

Example 5. Consider another simple positional servomecha- 


“ere 
de 
Fig. 12 
é= — 
K 
+, 
@ 


560 


nism, shown in Fig. 14, similar to that discussed in connection 
with Example 4. The difference is that now the output element is 
purely inertial (with negligible friction); an ideal lead network 
is used to stabilize the system. The nonlinearity arises because, 
to obtain the required feedback, a gear train must be used which 
is (as all conventional gear trains are) subject to backlash. The 
backlash characteristic is idealized as shown in Fig. 15. It is 
subject to the following interpretation: If ¢ > 0, i.e., the gears 
move in the positive direction, segment A of the characteristic 
applies. If now é decreases and goes through zero, then f(c) will 
stop at the value reached when the velocity was just zero (segment 
B), until c has traveled the small distance 2A in the negative direc- 
tion, at which time the gears again become enmeshed and f(c) 
starts decreasing along segment C. By symmetry, the same 
process occurs when é starts to increase, going through zero. The 


TORQUE 


GEAR TRAIN 
Fie. 14 


Fig. 15 


slopes of segments A and C may be taken as unity. The differ- 
ential equations of the system in the (c, ¢) phase plane are, assum- 
ing r(¢) =0 


d dé 
= and = + ¢— A) 


provided ¢ > 0 and gears are enmeshed (segment A) 


de dé 


provided ¢ < 0 and c* > ¢ > eo+ — 2A, and gears are not en- 
meshed (segment B) 


d dé 
= ¢and = +e + A)... [He] 


provided ¢ < 0 and gears are enmeshed (segment C) 


dc dé 
at = éand = —K,I- 
provided é > 0 and e~ < c < co~ + 2A, and gears are not en- 
meshed (segment D). 

Terms ¢*, ¢o~, +, '-, A are defined in Fig. 15. 


TRANSACTIONS OF THE ASME 


The regions of applicability of Equations [lla] and [11d] over- 
lap; the same holds for Equations [116] and [lle]. To simplify 
the discussion, assume arbitrarily that the gears are initially en- 
meshed and will continue to be for a small subsequent period of 
time. This amounts to specifying the initial state of the non- 
linearity. 

(i) r(t)=0. There are two linear families of trajectories cor- 
responding to the segments A and C in Fig. 15. For any fixed 
K2, the critical points will first be stable foci and then stable 
nodes as K, increases. Let the critical points be nodes; they are 
located at é = Oandc = +A. 

What happens to a trajectory tending to one of the nodes, say 
Na*), when it intersects the c-axis (see Fig. 16)? Assume the 
intersection occurs at c > A. The subsequent trajectory will 
obey [116], until ¢ has decreased by the amount 2A, at which 
time [llc] becomes applicable (segment C in Fig. 15). The 
family of trajectories obeying [11b] is not linear because the forc- 
ing function ['+ depends on the point where the trajectory 
intersects the c-axis. The locus of all points on trajectories of 
the family [116] whose c-co-ordinate is at a distance 2A from the 


point where the trajectory crosses the c-axis, is indicated by the 
heavy curve B in the phase plane. This curve may be shown 
to consist of parabolic segments. 

It is now a simple matter to sketch representative trajectories. 
If the trajectory starts in the upper [lower] half plane, it will 
tend to N4*) [(Ne*] until it arrives at the critical point or until 
it intersects the c-axis from above [below]. The trajectory is 
then governed by [115] (or [11d]) until it intersects the boundary 
curve B, after which it will tend to (Nc* [N4*)] and so on. 

Since the regions of validity of [1la to d] overlap, more than 
one trajectory may pass through the same point in the phase 
plane. This complication may be removed by using two phase 
planes; (a) a plane corresponding to the state, “gears not en- 
meshed,’’ in which the trajectories obey [11b] and [lld]; (b) a 
plane corresponding to the state, “gears enmeshed,’’ in which the 
trajectories obey [lla] and [llc]. The two planes are to be 
joined along the c-axis and along the curve B, but otherwise 
have no points in common. Hence Fig. 16 may be recognized as 
the projection of the trajectories of plane (a) on plane (6), subject 
to the assumption that all trajectories start initially in plane 
(b). The scheme just described is really a special case of a three- 
dimensional phase space, where the third dimension has been 
quantized so that only two values, corresponding to the states 
of the nonlinearity, are of interest. Mathematically, the situa- 
tion is entirely analogous to the use of Riemann surfaces for rep- 
resenting multivalued functions. 

Finally, from the geometry of the trajectories belonging to the 
nodes, it is clear that the trajectories cannot intersect the c-axis 
at |c| << A. What is the behavior of trajectories near the critical 
points? There are two straight-line trajectories tending to each 
node, such as F and S tending to N4*). These are so-called 
eigenvectors of the linear system (4) associated with the node. 


wen. SOURCE 
NETWORK AMPLIFIER 
f(c) “ 5 
Fia. 16 
-A 5 c ia 
Yor 


APRIL, 1957 


All trajectories which ultimately arrive at N4*) must approach 
it in the sector bounded by the c-axis and F and containing S; 
all such trajectories tend asymptotically to S. On the other 
hand, if trajectories in the upper half plane originate outside of 
this sector, ultimately they will intersect the c-axis, move into 
the lower half plane, etc. Hence N4*) appears as an actual criti- 
cal point for all trajectories contained within the former sector, 
and it appears as a virtual critical point for trajectories not con- 
tained in that sector. Therefore the nodes may be called semi- 
actual or semi-virtual; this is the reason for notation N4*), 
(Ne*. 

Notice also that it is possible to have a stable limit cycle 
LaLglcLp, as shown in Fig. 16. All trajectories which intersect 
the c-axis in the interval A < |c| < ez, where c, denotes the inter- 
section of the limit cycle with the positive c-axis, will tend to the 
limit cycle. Further, if a trajectory is at or near a critical point, 
a small perturbation is sufficient to carry it to the limit cycle. 
Hence, practically speaking, all trajectories will ultimately lead 
to the limit cycle. 

From Fig. 16 it is seen that a necessary condition for the exist- 
ence of a limit cycle is that it intersect the curve B at a point 8 
which is closer to the origin than the intersection a of the line 
F with B. By a much more complicated analysis it is possible 
to show that this condition is also sufficient for the existence of the 
limit cycle (17), which is stable. Since from [lla] the equation 
of the line F is (4) 


é = 3(c— A).... 


where s: is the smaller (in algebraic value) characteristic root of 
{lla] and since — as K, it is not possible to avoid 
the situation shown in Fig. 16 by any choice of K,, although the 
amplitude of the limit cycle can be somewhat decreased by in- 
creasing K,. To destroy the limit cycle altogether, it is neces- 
sary to prevent F from intersecting B, which can be done only by 
modifying B. One possible solution lies in providing damping 
for the output element of the servo (17). 

Analysis of the case where the critical points of [lla] and 
{1lc] are foci (17, 18) is similar, Then both critical points are 
virtual; a stable limit cycle always exists. 

Owing to the double-valued nonlinearity, the phase-plane struc- 
ture shown in Fig. 16 is quite special. For instance, it violates 
the well-known requirement, valid when the nonlinearities are 
single-valued as in Section 3, that a stable limit cycle must en- 
close an unstable limit cycle or an unstable critical point—the 
stable limit cycle in Fig. 16 encloses two semi-actual critical 
points. This irregularity of the phase-plane structure may ex- 
plain a recent interesting result of Nichols (15), who found that if 
a system similar to that considered here is analyzed by means of 
Kochenburger’s method (11), a stable and an unstable limit cycle 
are predicted under certain conditions, whereas in reality only a 
stable limit cycle exists. This failing of the describing-function 
method is probably due to the fact that it calls for “linearizing’’ 
the nonlinearity of Fig. 15 in such a fashion which does not 
properly take into account the discontinuities. A safer pro- 
cedure would consist in considering additional dynamic effects, 
such as inertia and resilience of the gearing, in which case the 
nonlinearity becomes single-valued (16), but then it is necessary 
to deal with a higher-dimensional phase space and the situation 
will be appreciably more complicated. 

(ii) r(t) + 0. If the input is allowed to be a step, but not a 
ramp, the preceding discussion remains valid with only trivial 
changes in notation. If a ramp input is admitted, however, there 
is an essential difference. In that case, the critical points will be 
located at 


561 


This means that one of the critical points will be actual, the 
other virtual. As a result, the limit cycle will be destroyed. 
Physically, this is exactly what one should expect; if the gears 
are always driven in one direction, the effects of backlash disap- 
pear, 

6 InstaBitiry From THE Geometry oF Puase Space 


So far, the phase space of systems studied, excepting Example 
2, was a plane; i.e., two-dimensional and infinite in both dimen- 
sions. The next example is a system defined over a cylindrical 
phase space. The phase space is still two-dimensional but it is 
infinite only in one of the dimensions. 

Ezample 6. Consider a simple positional servomechanism 
shown in Fig. 17. The distinguishing feature of this system is 
that the output is detected by means of a continuous-rotation 
potentiometer. In other words, after rotation by the angle 27, 
the wiper arm of the potentiometer returns to its original position. 

To describe a nonlinearity of this type analytically and to set 
up the corresponding differential equations would be somewhat 
awkward. The following simple device, commonly used in 
topology (19), eliminates the difficulty and permits simple visual- 
ization of the problem. When |c| < 7, the system obeys a 
simple linear differential equation in the (c, é) phase plane. Now 
the fact that rotation by ¢ = 2m does not change the physical 
situation can be expressed by identifying points on the line c = 
—f with points on the line c = m having the same é-co-ordinate. 
Such an identification is merely another way of stating that the 
phase space is the (surface of the) infinite cylinder. The reader 


SERVOMOTOR 


CONTINUOUS 


ROTATION 
s (s+!) 


POTENTIOMETER 


Fic. 17 


may easily verify this state of affairs by cutting out an elongated 
rectangular strip and pasting together, i.e., identifying the long 
sides. 

Notice that the statement, “The phase space is cylindrical,” 
uniquely characterizes the nonlinearity involved, namely, the 
continuous-rotation potentiometer. A system defined over a 
cylindrical phase space is necessarily nonlinear. 

To analyze the system more closely, assume that the input 
is a step of magnitude |r|. There is only one critical point at 


ec =randé =0 [14] 


Depending on the loop gain K, the critical point may be a 
stable node N* or a stable focus F*. When the cylinder is cut 
along c = 7 and its surface is spread out on the plane, all trajec- 
tories are governed by a linear differential equation. There are 
several interesting cases: 

(i) |r| >. In this case there exists no critical point, for the 
point c = r, é = 0 cannot be found on the (surface of the) cylin- 
der. Of course, this is merely the result of stupid design; the 
servo cannot be expected to work with such large inputs. The 
problem of how the servo will behave is still interesting, however; 
it is also easy to explore. Let the critical point be a stable node; 
since it is located “outside” the phase space, it may be called a 
virtual critical point and denoted by (N*). The corresponding 
trajectories are shown in Fig. 18. A representative trajectory is 
denoted by ABB’CC’DD’. The are D’D is alimit cycle. Since 
in the phase space all trajectories ultimately converge to the eigen- 
vector (4) ED, it is clear that there is only one limit cycle, D’D, 


(13) 
+ 


562 


and that all trajectories tend to it. Physically, the limit cycle 
corresponds to the motor driving the potentiometer always in 
the same direction, but with a periodically repeated nonconstant 
speed. The case when the virtual critical point is (F *) is essen- 
tially the same. 

The limit cycle may be visualized by imagining a rubber band 
stretched around the cylinder in such a fashion that if the cylinder 
is cut along the rubber band, it falls into two half-cylinders. The 
limit cycle D’D is called a limit cycle of the second kind (2-3) to dis- 
tinguish it from the limit cycle of the first kind, which arose in 
connection with Examples 1 through 5. If the cylinder were 
cut along a limit cycle of the first kind, the result would be a cir- 
cular patch and a cylinder with a hole in its surface. 

(ii) |r| <r. If the critical point is N*, a glance at the fore- 
going analysis shows that there can exist no limit cycle and that 
all trajectories tend to the critical point. Suppose, however, 
that the gain K is large and therefore the critical point is F*. 


TABLE 1 


Principal cause 

of instability Sufficient 

Root locus crosses over 

=e half plane Re(s) for any incremental gain 

> 

System overdamped when 

operating linearly (low 
gain) 


System equations are de- 
fined discontinuously 


Multivalued nonlinearity 


Geometry of phase space 


(low gain) <r 


Fig. 18 


Furthermore, let the critical point be near one of the boundaries 
of the phase space, as in Fig. 19. If the trajectory originates at 
G near the critical point, then the transient is stable. This will 
happen, for instance, when there is a slight increase in input from 
r’tor. But if the trajectory originates much farther away, say 
at A, then it cannot spiral into the critical point without inter- 
secting the line c = m at B. Therefore the trajectory continues 
from B’, etc., asymptotically approaching the limit cycle L’L. 
All trajectories starting above the limit cycle, such as D’EE’..., 
approach the limit cycle asymptotically from the other side. 
The limit cycle is again of the second type. 

The region in which the trajectories converge to F * is shown by 
the crosshatching in Fig. 19. The boundary of the region is a 
trajectory. On the cylinder, the cross-hatched region appears as 
a helical band. As the critical point approaches either 7 or —1, 
the crosshatched region will shrink, becoming a point when 
|r| =. To assure stability for all |r| < , it is desirable that 
the critical point be N+. But if the range of the magnitude of 
r is smaller than 7, then somewhat less than critical damping can 
be tolerated (see Table 1). 

It is interesting to observe from Fig. 19 that the point (+7, 0) 
on the surface of the cylinder is a critical point; namely, a saddle 


No roots with Re(s) > 0 Not known. 
extremely complicated 


Overdamped: 
only when r # O an 
small; 


Viscous damping of output element 
Ir! < and overdamped <rifK 
i tanh 


TRANSACTIONS OF THE ASME 


point. A new critical point has arisen when the differential 
equation is defined on a new type of surface! This can be ex- 
plained very simply with the aid of topology. According to a 
fundamental theorem of Poincaré, the sum of indexes® of critical 
points of a surface must be equal to the Euler characteristic of 
the surface x. The number x = 1 for the plane if the system is in- 
crementally stable at every point sufficiently far from the origin; 
under the same assumption (which is always true in Example 6), 
x (cylinder) = 0. Hence either (a) there are no critical points on 
the cylinder or (b) the number of saddle points is equal to the 
number of other types of critical points. Case (i) of this ex- 
ample illustrates (a) and Case (ii) illustrates (6). A similar situa- 
tion exists if the differential equation is defined on the torus (i.e., 
when both phase-plane variables have the periodic property), 
since x (torus) = 0 always (19). 

A trivial example of a nonlinear system defined over a cylin- 
drical phase space is the ordinary pendulum. The most common 


SUMMARY OF EXAMPLES 


—Conditions for stability ——————— 
Necessary 


Nature of 
instability 
Stable ani unstable limit 
cycles; paths to infinity 


Effect of input 
May alter stability of 
critical point or limit 
cycles 
Sufficiently small ramp 
input creates limit cycle 


Likely to be 


Relaxation oscillation 
(nonanalytic limit cycle) 


Stable limit cycle Ramp input destroys limit 
cycle 

Stable limit cytle of second Sufficiently large step in- 
kind put creates limit cycle 


if K > 


example of engineering interest is the synchronous motor. No 
doubt there exist many other systems, especially among those 
connected with navigation and guidance problems, where one 
(or more) dimension in the phase space is not infinite but “‘cylin- 
drical.’’ When that is the case, effects similar to those discussed 
may be anticipated. 


7 Insrapitrry Causep BY SAMPLING 

A new type of discontinuity appears when a control system 
does not operate on continuous signals but receives information 
intermittently, in “sampled’’ form. If all other elements of 
the system are linear, sampling will not render it nonlinear, 
in fact, the linear theory of sampled-data systems is now well 
established (20). If there is some nonlinear element in the sys- 
tem, for instance, a relay or a saturating amplifier, then the 
interaction between the nonlinearity and the discontinuity due 
to sampling may lead to very complex phenomena which are as 
yet only incompletely understood. In view of the difficulty of the 
problem, no discussion is attempted here and the reader is re- 
ferred to a forthcoming paper of the author (21). 


* The index of nodes and foci is +1; the index of a saddle point is 
— 1 (see references 2, 3). 


= 

f 4 

5 
6 

= > REGIO 
Fig. 19 


APRIL, 1957 


CoNCLUSION 


The salient features of the examples discussed in the foregoing 
are collected in Table 1, which requires no further comment. 

There is little doubt that such a table is potentially incomplete 
at the present time, and that it will be superseded in the future 
by a richer as well as better organized classification of principal 
nonlinear phenomena encountered in control systems. The 
author would be greatly appreciative of any communication as 
well as criticism of the table, particularly welcoming practical 
examples which appear to violate or to confirm the contents of 
the table. 


ACKNOWLEDGMENTS 


This research was supported in part by the U. 8. Air Force 
under Contract AF 30(635)-2815, Task No. III, 45360, Project 
No. 4506. The paper has gained much from conversations of 
the author with Dr. L. A. Zadeh and Dr. K. Klotter. 


REFERENCES 


1 “Phase-Plane Analysis of Automatic Control Systems With 
Nonlinear Gain Elements,’’ by R. E. Kalman, Trans. AIEE, vol. 73, 
part II, 1954, pp. 383-390. 

2 “Theory of Oscillations,” by A. A. Andronov and C. E. 
Chaikin, translated by 8. Lefschetz, Princeton University Press, 
Princeton, N. J., 1949. 

3 “Introduction to Nonlinear Mechanics,"”” by N. Minorsky, 
J. W. Edwards, Ann Arbor, Mich., 1947. 

4 “Analysis and Design Considerations of Second and Higher 
Order Saturating Serv: hani ” by R. E. Kalman, Trans. 
AIEE, vol. 74, part II, 1955, pp. 294-310. 

5 “Engineering Cybernetics,"’ by H. 8. Tsien, McGraw-Hill 
Book Company, Inc., New York, N. Y., 1954. 

6 “Synthesis of Feedback Control Systems by Phase-Angle 
Loci,” by Y. Chu, Trans. AIEE, vol. 71, part II, 1952, pp. 330-339. 

7 “The Study of Transients in Linear Feedback Control Systems 
by Conformal Mapping and the Root Locus Method,” by V. C. M. 
Yeh, Trans. ASME, vol. 76, 1954, pp. 349-361. 

8 ‘*Phase-Plane Analysis—A General Method of Solution of 
Two-Position Process Control,’’ by D. P. Eckman, Trans. ASME, vol. 
76, 1954, pp. 109-116. 

9 “A General Equation for Relaxation Oscillations,”” by N. 
Levinson and O. K. Smith, Duke Math. Journal, vol. 9, 1942, pp. 382- 
403. 

10 ‘Limit Cycles and Rotated Vector Fields,’’ by G. F. D. Duff, 
Annals of Mathematics, vol. 57, 1953, pp. 15-31. 

11 “A Frequency-Response Method for Analyzing and Synthe- 
sizing Contactor Servomechanisms,” by R. J. Kochenburger, 
Trans. AIEE, vol. 69, 1950, pp. 270-284. 

12 “The Use of Nonlinear Feedback to Improve the Transient 
Response of a Servomechanism,” by J. B. Lewis, Trans. AIEE, vol. 
71, part II, 1952, pp. 449-453. 

13 “A Differential Analyzer Study of Certain Nonlinearly 
Damped Servomechanisms,”’ by R. R. Caldwell and V. C. Rideout, 
Trans. AIEE, vol. 72, part II, 1953, pp. 165-170. 

14 “Operating Modes of a Servomechanism with Nonlinear 
Friction,’’ by H. Lauer, Journal of The Franklin Institute, vol. 255, 
1953, pp. 497-511. 

15 “Backlash in a Velocity-Lag Servomechanism,” by N. B. 
Nichols, Trans. AIEE, vol. 72, part IT, 1953, pp. 462-467. 

16 “Stability Characteristics of Closed-Loop Systems With 
Dead Band,” by C. H. Thomas, Trans. ASME, vol. 76, 1954, pp. 
1365-1382. 

17 “Instrument Inaccuracies in Feedback Control Systems with 
Particular Reference to Backlash,” by H. T. Marcy, M. Yachter, 
J. Zauderer, Trans. AIEE, vol. 68,1949, pp. 778-788. 

18 “An Introduction to the Analysis of Nonlinear Closed-Cycle 
Control Systems,’’ by W. E. Scott, “Automatic and Manual Con- 
trol,” edited by A. Tustin, Academic Press, New York, N. Y., 1951. 

19 “Combinatorial Topology of Surfaces,” by R. C. James, 
Math. Magazine, vol. 29, 1955, pp. 1-39. 

20 ‘The Analysis of Sampled-Data Systems,”’ by J. R. Ragazzini 
and L. A. Zadeh, Trans. AIEE, vol. 71, part II, 1952, pp. 225-234. 

21 ‘Nonlinear Aspects of Sampled-Data Control Systems,”’ by 
R. E. Kalman, presented at Symposium on Nonlinear Circuit 
Analysis, Polytechnic Institute of Brooklyn, April 26, 1956; to be 
pu!lished in the Proceedings of the Symposium. 


Discussion 


R. W. Bass.’ This paper is an exceptionally fine contribution 
to the subject. It abounds in deep insights and the author dis- 
plays an acute feeling for the mathematical questions involved. 
In the writer’s opinion, the author’s rapid new method of “vir- 
tual’’ and actual critical points provides one of the most valuable 
techniques now available. 

A promising aspect of this method is its applicability in n- 
dimensions (n > 2). But then if a trajectory is trapped between 
(say) two concentric spherical surfaces with no critical point 
available, one cannot assert (as in Figs. 5, 7) that the trajectory 
will tend to a periodic orbit (or limit cycle). In fact, the Poin- 
caré-Bendixson theorem (1) does not hold for n > 2. It is 
true, however, that (2) the trajectory will tend to a limit trajec- 
tory which is recurrent in the sense of G. D. Birkhoff (3). A 
recurrent orbit is nearly ‘‘almost periodic in the sense of Bohr.’’ 
But a recurrent orbit represents unstable performance just as 
much as does a periodic orbit. Consequently one can avoid 
the mathematical subtleties (2) involved here, and proceed to 
use the author’s method with success. 

The writer’s criticism is that not all of the author’s statements 
are established in quite the rigorous style which is currently 
accepted as mathematical proof. However, the writer has 
supplied several such proofs and will publish them elsewhere. 

In particular, the theorem and corollary of Section 3.1 are near 
to special cases of an unpublished theorem which the writer had 
found previously and independently: 

Let z = (x,..., 2,) be a (column) n-vector, let |2| = (2:7 + 
... + 2,%)'/, let A be an n X n matrix, and let J be the unit 
vector. Let the matrix exp (At) be defined as in references (4) 
or (1). Then, if z represents the error (and its derivatives), 
every control problem can be written 


= Az + f(z) + g(t) 


where, if the reference variable and its derivatives are bounded, 
|g(t)| < m for allt 2 0. We assume that the vector f satisfies 
f(0) = 0. 

If f(x) saturates, then we can write f(z) = a(xz)z, where the 
matrix satisfies 


la(xo)z| 


for all x and all zo. 
If ¢ = Az is “stable,’’ i.e., if all characteristic roots of A have 
negative real parts, then 
lexp S |x| exp (—AZ) 


for all zo, all ¢ = 0, and some A > 0, y > 0. 
The author’s criterion is that @ = [4 + a(zxo)]z also be stable 
for every 7. A sufficient condition for that is 


which we shall assume. 
Under these hypotheses, every solution of [Z;] satisfies 
|x(t)| y{|x(0,| exp (—O0t) + m/0], @=A— 7K... 
for allt 2 0. Hence any periodic solutions must satisfy 
|x(t)| -ym/0 


The Inequalities [I,] — [I:] recapitulate most of linear and 
quasilinear servo theory: For step inputs (m = 0) the error 


* Department of Mathematics, Princeton University, Princeton, 
N. J. 

%” Numbers in parentheses refer to the References at the end of 
this discussion. 


564 


|2(t)| + 0 as t + ; while for dynamic inputs the maximum 
steady-state error depends on the maximum rate of change of 
the input (measured by m), and the “amount’’ (6/7) of inher- 
ent stability. 

Corollary: When m = 0, a necessary condition for instability 
is | f(xo)| 2 for some 20. 

We consider next whether the author’s piecewise-linear ap- 
proximations are valid. Kaplan (5) already has proved that 
frequently one niay establish all the qualitative features of the 
trajectories (the “phase portrait’’) by examining the field of 
directions defined by the differential equation at a finite (but 
“sufficiently large’) number of points in the phase space. 

Concerning the author’s procedure itself, the writer has proved 
the following theorem: 

Let n = 2; let the 2-vector function h(x) = [hi(x1, 22), he(x, 
r2)] satisfy a Lipschitz condition. Suppose that 


is ‘‘dissipative,’’ i.e., that there is a circular domain D: x? + 
ae? < R* which all trajectories eventually enter. Suppose that 
h(0) = 0 but that |A(x)| ~ 0 for \x| ~0. Then D can be de- 
composed into the union of certain (‘“‘sufficiently small’’) closed 
regions D,;(i = 1, ..., N)) to which corresponds a piecewise-linear, 
continuous-vector function 


= A,jzforzin D; (i = 1,... 
which is such that the phase portraits of [E2] and 


t= h(x). 


are topologically equivalent (with a single possible exception). 


The exception: If [E.] has one or more semistable (i.e., stable- 
unstable) limit cycles, they may not occur in [E,]._ The author’s 
discussion reminds us that stable limit cycles can be observed 
directly, and unstable limit cycles, indirectly. But (because of 
‘noise’’) a semistable limit cycle cannot be observed empiri- 
cally (or graphically) either directly or indirectly. Hence such 
omissions in the author’s approximations [E;] — [E,] are of no 
significance for engineering. 

In reply to the author’s concluding question, the writer refers 
to an analysis [(6), summarized in (7), (8)] of the Example 4, in 
which he neglects coulomb friction, but considers relay dead 
zone, hysteresis, and time delay. As these three parameters are 
varied, one can obtain both the author’s Fig. 5 and Fig. 7. Since 
the corresponding ideal system is stable for step inputs, the 
writer would make a sixth row in Table 1 for this example, listing 
as the principal cause of instability not ‘‘discontinuity’’ but 
“system delays.’’ ([Incidentally, a new method, based on refer- 
ence (9), for eliminating such delays is given in (7), (8).] Indeed, 
delays are a very important source of instability. They could well 
be listed separately from row three in Table 1, since, e.g., hystere- 
sis does not alter the phase-portrait’s independence of time. 


REFERENCES 


1 ‘Lectures on Differential Equations,’ by S. Lefschetz, Prince- 
ton University Press, Princeton, N. J., 1946. 

2 ‘Topological Dynamics,’’ by W. H. Gottschalk and G. A. 
Hedlund, American Mathematical Society Colloquium Publication, 
vol. 36, 1955. 

3 “Dynamical Systems,’’ by G. D. Birkhoff, American Mathe- 
matical Society Colloquium Publication, vol. 9, 1927. 

4 “Stability Theory of Differential Equations,’’ by R. Bellman, 
MeGraw-Hill Book Company, Inc., New York, N. Y., 1953. 

5 ‘‘Dynamical Systems With Indeterminacy,” by W. Kaplan, 
American Journal of Mathematics, vol. 72, 1950, pp. 573-594. 

6 “The Analysis and Synthesis of Relay and Nonlinear Servo- 
mechanisms,”’ by R. W. Bass, The Johns Hopkins University Insti- 
tute for Cooperative Research, 1955. 


TRANSACTIONS OF THE ASME 


7 ‘Improved On-Off Missile Stabilization,”’ by R. W. Bass, 
American Rocket Society Preprint No. 285-56; submitted to Jet 
Propulsion. 

8 ‘Equivalent Linearization, Nonlinear Circuit Synthesis, and 
the Stabilization and Optimization of Control Systems,"’ by R. W. 
Bass, to be published in Proceedings of the Symposium on Nonlinear 
Circuit Analysis, MRI Symposia Series, vol. 6, October, 1956. 

9 “A Generalization of the Functional Relation Y(t + 8) = 
Y(t) Y(s) to Piecewise-Linear Difference-Differential Equations,”’ by 
R. W. Bass, to appear in the Quarterly of Applied Mathematics. 


Dunstan Grauam." It has long been accepted that analyses 
of nonlinear control systems are restricted to a very limited 
degree of generality. This paper, in which the author classifies 
and analyzes nonlinear systems in general terms, represents a 
fresh approach, the importance of which cannot be overempha- 
sized. 

Linear-system design has long profited by the “feel’’ which 
the engineer can obtain for the problems which he must attack. 
This paper is the beginning of a feel for nonlinear problems which 
will enable the engineer to attack them much more successfully. 


K. Kuorrer.'? The writer wishes to comment briefly on 
three points of the author’s very able presentation. All of the 
comments pertain either to nomenclature or to definitions: 


1 The author has introduced the concept of “virtual critical 
(or singular) points’’ as contrasted to “real critical points’ and 
he should be congratulated on this very fortunate designation. 
The designation seems fortunate because the concept is in im- 
mediate and complete parallelism to such well-known and long- 
established concepts as “virtual images’’ in geometrical optics, 
‘‘virtual nodes’’ in theory of torsional vibrations of engine shafts, 
and many others. 

2 Less fortunate, in the writer’s opinion, is another expression 
coined by the author, the designation of a nonlinear system as 
“strongly nonlinear’ if it is capable of self-sustained oscillations, 
and (by implication) of ‘weakly nonlinear’’ if it is not capable of 
self-sustained oscillations. The choice of words for describing 
these two cases seems not so good, because “‘strong’’ or ‘‘weak’’ 
ordinarily carry connotations of quantity rather than quality. 
It is a distinction of quality, however, which the author intends 
to make. If one would adopt the author’s designations, a system 
described by van der Pol’s differential equation 


— eq(1 — atg*) + = 0 


would have to be called ‘strongly nonlinear’? however small the 
coefficient € of the nonlinear term may be, whereas a system 
described by a differential equation like the following 


+ + x*q(1 + = 0 


would have to be called weakly nonlinear however large the co- 
efficient 4? of the nonlinear term may be. It seems obvious that 
such designations will tend to create confusion or at best stand in 
permanent need of specific and detailed explanations in order to 
eradicate connotations which come naturally to a reader’s or 
listener’s mind. 

Instead of the quantitative designations strongly nonlinear or 
weakly nonlinear qualitative designation like “essentially non- 
linear’ or “intrinsically nonlinear’ or the like and their opposites 
may be better. However, those expressions also will stand in 
need of repeated definition and explanation until they may be- 
come generally accepted terms. On the other hand, one may ask 
whether there is indeed a necessity for coining new expressions 


11 Section Head, Automatic Flight Controls, Lear, Inc., Grand 
Rapids Division, Grand Rapids, Mich. 

12 Professor of Engineering Mechanics, Stanford University, Stan- 
ford, Calif. Mem. ASME. 


APRIL, 1957 


for those cases. Would “capable of sustained oscillations’ and 
‘not capable of sustained oscillations’ not be sufficient to de- 
scribe the difference the author wants to emphasize? 

3 The last comment pertains to the author’s definition of a 
critical point as unstable if all the real parts of the characteristic 
roots associated with the critical point are non-negative. This 
definition compels the author to call a saddle point “neither 
stable nor unstable.’’ 

The author’s definition is in strict and direct contradiction to 
all usages known to the writer, where a system is called unstable 
if the real part of even one of the characteristic roots is non- 
negative (see e.g., the Routh-Hurwitz criteria). In this termi- 
nology, a saddle point then is to be called unstable. 

It seems highly undesirable to introduce definitions which 
clash with others that have been used for a long time. 


AvuTHorR’s CLosURE 


The author is grateful for the lively and constructive criticisms 
the paper has produced. 

Dr. Bass’ remark regarding mathematical rigor is worth addi- 
tional emphasis. Because the behavior of nonlinear systems may 
present exceptionally subtle aspects, any conclusions in analyzing 
them should be arrived at as rigorously as possible. Only rigo- 
rously obtained results can be used as a sure starting point for 
further investigations. On the other hand, excessive preoccupa- 
tion with complete rigor tends to put road blocks in the develop- 
ment of original ideas which, in their crudest form, always stem 
from intuitive considerations. 

In so far as the present paper is concerned, the principal point 
where additional rigor is needed is the statement, introductory to 
the theorem in Section 3.1, that “. . . a nonlinearity may be ap- 
proximated with a sufficiently large number of straight-line seg- 
ments so that essential features of the root locus are preserved.’’ 
More precisely, this means that the topological aspects of the solu- 
tion of a differential equation are preserved whenever the non- 
linearity is replaced by a suitably chosen straight-line approxima- 
tion. The proof of this conjecture is trivial in the case of a first- 
order differential equation (22).'* Dr. Bass’ theorem stated in the 
second part of his discussion disposes of most of the interesting 
cases when the differential equation is of the second order. How- 
ever, there are some unsettled questions when the differential 
equation is higher than second order. The conjecture has strong 
intuitive appeal, but it is important to have a rigorous proof also, 
particularly to be able to recognize certain degenerate cases when 
it may not be true. 

The examples in the paper after the first one do not make use of 
this basic conjecture but deal with straight-line-type nonlineari- 
ties only. Under these circumstances the statements made in 
connection with the various examples can be proved rigorously 
(using mainly geometric facts concerning linear trajectories) when 
the differential equation is of the second order; in many cases, the 
discussion had to be greatly abbreviated for lack of space. The 
details can be easily supplied by the reader after some acquain- 
tance with the method. When the differential equations are of 
higher than the second order, the procedure is much more dif- 
ficult and not yet fully worked out, although several examples 
have been considered in detail by the author in reference (4) of 
the paper. 

As mentioned in the paper, a rigorous proof of the theorem of 
Section 3.1 is apparently not yet available. It is possible, how- 
ever, to have somewhat less general theorems concerning mono- 
stability. An example of these is the theorem stated by Dr. Bass 
in connection with the system [F,]. Since it applies only to cases 


13 Numbers in parentheses refer to additions to the References of 
the paper and appear at the end of this closuze. 


565 


where the sign of the components of the vector function f is imma- 
terial, the practical applicability of the theorem is severely re- 
stricted. By certain modifications it is possible largely to remove 
this limitation (23, 24). At any rate, theorems of this type 
(which guarantee monostability in a nonlinear system regardless of 
the order of the differential equation) have much potential engi- 
neering usefulness and provide a fertile area for additional work. 

The influence of delays, mentioned by Dr. Bass, was not con- 
sidered since the paper was restricted to differential equations. 
Practically speaking, delays are introduced in the analysis of con- 
trol systems because impulse response of linear parts of the system 
frequently can be more conveniently approximated by a delay 
term e~*? and a rational transfer function of low degree than by a 
rational transfer function of high degree. Since physical systems 
seldom behave corresponding to a mathematically exact delay, 
the whole matter concerns the semi-empirical problem of replacing 
a physical system by an idealized mathematical model. Since 
very little is known about behavior of nonlinear delay-differential 
equations, the use of time delay for model building purposes in the 
nonlinear case is a highly treacherous undertaking. This may be 
illustrated by the celebrated case of the Goodwin business-cycle 
model. 

This model (25) is as follows: 


wt) + oy(t) = T)].... [Es] 


where y(t) = national income, 7’ = time delay between invest- 
ment decisions and corresponding outlays, ¢ = positive constant, 
and @ = nonlinear induced-investment function, similar to the 
straight-line approximation in Fig. 3 of the paper. 

To be able to analyze this system, Goodwin had to expand the 
term y(t— 7) in a Taylor series. Moreover, he had to stop at the 
second term in the Taylor series in order not to get a differential 
equation of higher than second order. Under these circumstances 
Goodwin demonstrated the existence of a unique, stable-limit 
cycle. However, using the approximate method of equivalent 
linearization, Bothwell (26) has shown that if the delay term is not 
approximated by the Taylor series there will be an infinite num- 
ber of limit cycles with monotonically decreasing period. In fact, 
there will be roughly as many limit cycles as the number of terms 
retained in the Taylor expansion. Although Bothwell’s method 
was not rigorous, there is little doubt about its validity; the same 
conclusions have been checked experimentally by means of an 
analog computer (27). Of course, this destroys the original intent 
of Goodwin to find a simple explanation for the business cycle 
using a nonlinear model. 

The lesson to be learned is this: Since the empirical or theoreti- 
cal justification of using a delay term in the foregoing system is no 
better than, say, using two terms in the Taylor series, it is prob- 
ably preferable not to use delay to approximate the response of 
physical systems. If a Taylor series approximation to the delay is 
used, as in Example 1 of the paper, the instability would fall under 
the first line in Table 1, 

Professor Klotter’s second comment refers to terminology used 
in the earlier (typewritten) version of the paper. As a result of 
his recommendation made orally during the Princeton conference, 
the terms “‘weakly’’ and “strongly nonlinear’’ have been replaced 
in the present version of the paper by “monostable’’ and “not 
monostable,’’ respectively. The present wording (suggested by 
L. A. Zadeh) is motivated by analogous designations used in elec- 
trical engineering practice in connection with multivibrators. 
Specifically, a monostable multivibrator has one stable equi- 
librium point, a bistable multivibrator has two, while an astable 
multivibrator has a single, stable, limit cycle. 

The terms “not capable of sustained oscillations’’ or “capable 
of sustained oscillations’ would not be adequate to cover fully the 
situation because of the possibility of more than one stable equi- 


566 


librium point; moreover, in certain cases, the steady-state be- 
havior of a nonlinear dynamic system cannot even be character- 
ized by the phrases “there are one or more stable equilibrium 
points and one or more stable limit cycles.’’ The so-called “er- 
godic’’ case of solutions of a differential equation defined on the 
torus falls in this category (28). There are also practical cases 
where the behavior of a sampled-data system (governed by a non- 
linear-difference equation) should be described in the language of 
probability theory (29). Thus the word “monostable’’ is de- 
sirable to describe the situation where the solutions of a nonlinear 
differential equation are topologically equivalent to solutions of 
a stable linear differential equation. 

Professor Klotter’s third comment touches on matters which 
could not be discussed in the paper because of space limitations. 
The words “‘stable’’ and “‘unstable’’ are commonly used for over- 
all characterization of the behavior of linear autonomous sys- 
tems. Unfortunately, it appears impossible at present to give a 
really satisfactory definition of stability in the nonlinear case, be- 
cause of the tremendous range of possibilities which must be con- 
sidered (30). In the author’s view, the use of these concepts in con- 
nection with nonlinear systems should be avoided if not altogether 
excluded because of their inherent ambiguity. 

It is sometimes suggestive, however, to use “‘stable’’ and ‘“‘un- 
stable’’ in a much narrower sense. In particular, the terminology 
“stable focus’’ or “unstable focus,’’ and so on, is motivated by the 
fact that all solutions starting in a small neighborhood of a stable 
focus converge to it ast-—> + ~, while all solutions converge to an 
unstable focus ast—>—o. Hence there is a sort of complemen- 
tarity between stable and unstable focus, corresponding to a 
change in sign of t. On the other hand, topological aspects of the 
behavior near a saddle point do not change as ¢ is replaced by —t. 


TRANSACTIONS OF THE ASME 


For this reason, it does not make sense to talk about stable or un- 
stable saddle points. The terminology is intended to accentuate 
the difference between a foci and nodes on the one hand, and 
saddle points on the other, which is all important in nonlinear 
problems. In fact, saddle points are hardly differentiated from un- 
stable foci or nodes in the discussion of linear systems, since they 
are all ‘“‘unstable’’ (in Professor Klotter’s terminology). The fact 
remains that our present-day usage in connection with linear 
systems is much too oversimplified to accommodate the manifold 
distinctions which must be recognized in dealing with nonlinear 
systems. The situation is perhaps not unlike concepts of gram- 
mar which are difficult to grasp clearly when studied from the 
point of view of the English language which has an oversimplified 
structure, while the same concepts of grammar appear to be much 
clearer when studied in relation to a more complex language, such 
as German or Greek. 


REFERENCES 

22 Appendix A, reference (21). 

23 Reference (8) of Dr. Bass’ discussion. 

24 Oral communication by Dr. Bass. 

25 ‘The Nonlinear Accelerator and the Persistence of Business 
Cycles,’ by R. M. Goodwin, Econometrica, vol. 19, 1951, pp. 1-17. 

26 ‘The Method of Equivalent Linearization,”’ by F. E. Bothwell, 
Econometrica, vol. 20, 1952, pp. 269-283. 

27 ‘‘Goodwin’s Nonlinear Theory of the Business Cycle: An 
Electro-Analog Solution,’’ by R. H. Strotz, J.C. McAnulty, and J. B. 
Naines, Jr., Econometrica, vol. 21, 1953, pp. 390-411. 

28 ‘Theory of Ordinary Differential Equations,”’ by E. A.Codding- 
ton and N. Levinson, McGraw-Hill Book Company, Inc., New York, 
N. Y., 1955. 

29 Theorem 5, reference (21). 

30 “On the Stability of Mechanical Systems,” by J. J. Stoker, 
Communications on Pure and Applied Mathematics, vol. 8, 1955, pp 
133-142. 


> 


Determination of the Characteristics of 
Multi-Input and Nonlinear Systems 
From Normal Operating Records 


By T. P. GOODMAN,? CAMBRIDGE, MASS. 


A method is presented for discovering, from the random 
variations in normal operating records, the impulse re- 
sponses, or weighting functions, relating an output of a 
system to two or more inputs that are mutually corre- 
lated. This method makes use of the statistical autocorre- 
lation and cross-correlation functions of the system inputs 
and outputs. From these correlation functions, the 
weighting functions are obtained by a process of decon- 
volution by means of an electronic delay-line synthesizer. 
The method is described first for linear systems and is then 
applied to nonlinear systems. It is an extension of a 
method presented in an earlier paper for discovering the 
weighting function of a linear system with one input. 


INTRODUCTION 


N designing, improving, or evaluating the automatic control 
I of a physical system or process, the characteristics of the 
system must be known in such a form that the output for a 
given input can be determined. The system characterization to 
be used in the present study is the impulse response, or weighting 
function, of the system; that is, the response of the system to a 
unit impulse input. In a linear system, the step response may be 
obtained by integrating the impulse response, and the frequency 
response may be obtained by taking the Fourier transform of the 
impulse response (1). Heretofore the usual way of obtaining 
these responses has been to interrupt the operation of the system 
and subject it to the appropriate artificial disturbances (impulses, 
step functions, or sinusoids). It has been shown recently (2, 3, 4) 
however, that the same information can be obtained by statistical 
correlation of the random variations normally present in the 
system’s inputs and outputs, without subjecting the system to any 
artificial disturbances. 

In an earlier paper (3), a statistical method was described for 
obtaining the weighting function of a linear system with a single 
input and a single output. Experimental results (3, 4) indicated 
that the method had sufficient promise to justify further develop- 
ment in two directions: 


1 Additional experience in practical application of the method, 

1 Based on a portion of a thesis undertaken in partial fulfillment of 
the requirements for the degree of Doctor of Science in Mechanical 
Engineering at Massachusetts Institute of Technology, Cambridge, 
Mass. This investigation was supported in part by a research grant 
from the Department of Mechanical Engineering, Massachusetts In- 
stitute of Technology. 

2 Assistant Professor of Mechanical Engineering, Massachusetts 
Institute of Technology. Assoc. Mem. ASME. 

§ Numbers in parentheses refer to the Bibliography at the end of the 


paper. 

Contributed by the Instruments and Regulators Division of Tre 
American SocreTy oF MEcHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prinuce- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
3, 1956. Paper No. 56—IRD-17. 


2 Extension of the theory to make it applicable to more com- 
plex systems, 

The present paper is a contribution in the second direction. It 
is planned to obtain additional experience in the practical applica- 
tion of the method as rapidly as data from industrial processes 
become available for study. 

In the present paper, the statistical method for obtaining the 
weighting function of a single-input linear system is briefly re- 
viewed, and the method then is extended to provide a straight- 
forward way of dealing with systems with multiple inputs that are 
mutually correlated. It is shown that when the method is applied 
to a nonlinear system, the weighting functions obtained are those 
of the optimum linear representation (in a mean-square sense) of 
the system. The application of linear representations in design is 
briefly discussed. Some methods for obtaining the frequency- 
response functions corresponding to the weighting functions of a 


system are suggested. 
Review or MEeruop For SINGLE-INput SysTEMs 


The dynamics of a linear system with a singie input m(t) and 
a single output c(¢), shown in Fig. 1, are characterized completely 
by the system’s impulse response, or weighting function. The 


m(t) c(t) 


Fie. Sinere-Input, System 


weighting function is a continuous function A(t), but for con- 
venience in computation, the area under the curve of h(t) can be 
approximated by a sequence of thin rectangles, each of width 7. 
Then A(t) for any stable system can be denoted by a finite time 
sequence 


h(t) = E Th(O), TA(T), Th(2T), . ., 


where the general term h, is equal to 7 times the response at time 
t to a unit impulse at time (¢ — n7). 

As shown in reference (3), the output of the system for any in- 
put may be approximated in terms of the weighting function as 


k 
et) = — nT) 


In this paper the assumption is made that the systems dealt 
with are time invariant; that is, that their weighting functions 
do not change with time. The method described in this paper can- 
not be applied usefully to time-variant systems unless the time 
variations are slow compared with the length of the input and out- 
put records required to determine the weighting functions. 


567 


568 


The process of determining the weighting function of a system 
from normal operating records of its input and output is facilitated 
greatly by using the autocorrelation of the input 


Dunl(T) = mt)m(t + [3] 
and the cross-correlation of input with output 


= + [4] 


where the bar denotes averaging over all available records. 
When both sides of Equation [2] are correlated with m(t) 


k 
+7) = hy m(tim(t + 7 — 
n=0 


or 
k 


= — AT)... [5] 

n=0 
As described in reference (3) the advantages of using the 
correlation functions, rather than the input-output records them- 
selves, to determine the weighting function of a linear system are: 


1 The effects of noise are largely eliminated, even when the 
system is part of a closed loop. 

2 The autocorrelation of a nonperiodic input has a high cen- 
tral peak and approaches a steady-state value for large positive 
and negative values of r; hence Equation [5] is easier to solve for 
the weighting function h,, than is Equation [2]. 


The process of determining the weighting function h, from 
Equation [5] is known as deconvolution. This may be done 
numerically by solving a matrix equation obtained by writing 
Equation [5] for k + 1 values of r. Deconvolution is facilitated 
greatly, however, by an electronic device (3) known as the “delay 
line synthesizer’ (DLS). The DLS of Fig. 2 consists of a tapped 
electronic delay line with 20 equal time delays each of amount T, 


m(t-T T) 


TIME 
T 


Fie. 2 Brock Diagram or Devay-Line SynrHEsizER (DLS) 


coefficient multipliers by which each delayed input is multiplied 
by the appropriate h,, and a summing panel. Thus, when m(t) is 
used as the input voltage to the DLS and the coefficients h, are 
set on the multipliers, the output voltage is c(t), as indicated by 
Equation [2]. Similarly, when ¢,,,,(7) is used as the input, and 
the same coefficients h, are used, the output is ¢,,,(7), as indicated 
by Equation [5]. Conversely, to solve for the h, when @,,,,(7) and 
$,.(T) are known, ¢,,,,(7), a8 obtained from a function generator, 
is used as the input, and the coefficient settings are adjusted until 
the output is matched with ¢,,,(7) as closely as possible. The 
weighting function then may be obtained either by reading off the 
coefficient settings or by experimentally measuring the impulse 
response of the DLS itself. If the delays obtained by the DLS 
were perfect, both of these methods would give the same results 


TRANSACTIONS OF THE ASME 


because the impulse response of the DLS would be a sum of de- 
layed impulses, Since the delays are not perfect, the actual im- 
pulse response of the DLS is a sum of rounded pulses, giving a 
continuous curve with interpolation between the discrete values 
h,. For this reason the experimental impulse response of the 
DLS is a closer approximation than the sequence of coefficient 
settings h,. For simplicity, however, the equations of the follow- 
ing discussion are written in terms of the discrete form of the im- 
pulse response. 


Systems Witu Two Inputs 


For a linear system with two inputs and one output, as dia- 
grammed in Fig. 3, there are two weighting functions, which may 
be denoted by g(t) and A(t), such that 


k 
Xt) = >> — nT) + hyy(t —nT)]....... [6] 


n=0 


x(t) 


~~ att) 


z(t) 


- 
- 


y(t) 
Fie. Two-Inpvut, System 


To aid in the solution of Equation [6], both sides of the equa- 
tion may be correlated, first with z(t) and then with y(t) 


k 
belt) = — + —nT)]....(7] 
n=0 


k 
G7) = >> — nT) + — 
n=0 


A comparison of Equations [7] and [8] with Equation [5] 
shows that while Equation [5] can be solved numerically by in- 
verting a(k + 1) X (k + 1) matrix, the solution of Equations [7] 
and [8] requires a 2(k + 1) K 2(k + 1) matrix. However, since 
the difficulty of solving a matrix equation increases progressively 
as the number of terms increases, Equations [7] and [8] are much 
more than twice as difficult to solve as Equation [5). 

Equations of the form of Equations [7] and [8] are well known 
in multiple-correlation analysis and have been used in problems of 
prediction in economics and in many other fields. Wiener (5) 
has shown that it is sometimes adyantageous to take the Fourier 
transforms of Equations [7] and [8] to obtain solutions for the 
weighting functions in the frequency domain before transforma- 
tion back to the time domain. However, for this investigation, a 
less cumbersome technique was believed to be desirable. 

A simultaneous solution of Equations [7] and [8] can be ob- 
tained by use of two DLS’s. One DLS is used for the g, and the 
other for the h,. This process requires switching back and forth 
between two sets of inputs to get two outputs that most nearly 
approximate ¢,, and ¢,,, respectively. If four DLS’s are availa- 
ble, and if their coefficients can be coupled to give two pairs of 
DLS’s so that the two DLS’s in each pair have the same coefficient 
settings, then the problem can be solved directly. This procedure 
is illustrated in Fig. 4. But to avoid the use of so much equip- 
ment, it is desirable to have a means for solving for one weighting 
function independently of the other. 

First, it may be noted that if z and y are uncorrelated, that is, if 
$.,(7T) = $,.(7) = 0 for all r, then Equations [7] and [8] reduce 
to equations of the same form as Equation [5], and may be solved 
separately. This suggests that if z and y are correlated, a trans- 


— 


APRIL, 1957 


OLS 


TO BE MATCHED 


G WITH 
BY ADJUSTING DLS'S 


OUTPUT TO BE MATCHED 


BY ADJUST OLS'S 


Fis. 4 DLS ror WeiGHTING FUNCTIONS 
or Two-Input System 


formation of variables should be sought that isolates the com- 
ponent w(t) of y(t) that is uncorrelated with z(t). 

If the relationship between z and y is linear, y may be written 
in the form. 


k 
y(t) = wt) + 


n=-—k 


where the f, are the time-sequence representation of a linear 
weighting function relating x(t) and y(t). The summation ex- 
tends from —k to +k in order to include the effect of the past of 
z(t) on the future of y(t) (positive n) as well as the effect of the 
past of y(t) on the future of x(t) (negative n). In the summation, 
k should be chosen large enough to include all the significant dy- 
namical effects in the physical system; that is, k should be large 


enough so that 
y(tja(t + nT) ~ 0, n>k 


The problem now is to find the f, that make w(¢) uncorrelated with 


x(t). 
When both sides of Equation [9] are correlated with x(t) 


k 
= belt) + D> — nT) 


n=—k 


Thus, by defining the f,, by the relation 


k 
(7) = >> fibe(t — nT) 


n=—k 


the condition that @,,(r) = 0 is fulfilled. 

When the curves ¢,, and @,, are set up on a function generator, 
Equation [10] can be solved for the f, on a DLS. The DLS is 
then a dynamical model of the weighting function relating z(t) 
and y(t). “Negative” time delays are obtained merely by shifting 
the time axis. With the DLS set up in this way, z(t) can be fed 
into the DLS, and the output, when subtracted from y(t), gives 
w(t). 

This process of finding the component of y(t) that is uncorre- 
lated with z(t) is essentially a process of orthogonalization. The 
statement that w(t) is uncorrelated with z(t) is equivalent to 
the statement that w(t) is orthogonal to all the members of the time 
sequence z(t + n7'), in the sense that w(t)z(t + n7’) = 0 for all n- 
This use of the word “orthogonal” is an extension of its use in 
connection with vectors: two vectors are orthogonal when their 
scalar product is zero. 

The weighting function f(t) may be thought of as representing 
an equivalent causal path relating z and y, Fig. 5. In the actual 
system, it is reasonable to assume that a nonzero correlation be- 
tween z and y is an indication of a causal relationship. By using 
both positive and negative values of n, all linear causal relation- 


569 


ships between z and y can be represented mathematically by Fig. 
5, even though the physical relationship may be more complex. 
When the relationship between z and y is nonlinear, Equation 
[10] gives the best linear representation (in a mean-square sense ) 
of the nonlinear relationship; this is shown later in the section on 


x(t) 


~~~ glt) 

nit) 
wit) y(t) 


Fie. 5 Retation Between Inputs 1n Two-Input System 


z(t) 


nonlinear systems. For the remainder of this section, this rela- 
tionship is assumed to Le linear. 

To simplify the discussion in the remainder of this section, a 
further assumption is made, that all inputs are statistically sta- 
tionary; that is, that their statistical properties such as auto- 
correlations and cross-correlations do not change with time, This 
assumption permits the time origin to be shifted without affecting 
the value of the autocorrelation or cross-correlation function. 
For example 

+ nT) = +7 + nT) = x(t + nT) 
The modifications necessary when dealing with nonstationary 
inputs are discussed in Appendix 2. 

Use of Equation [9] in view of the fact that w is uncorrelated 
with z, yields Equations [11] through [13]. In these equations, n 
is merely a dummy variable of summation; any other letter can 
be used equally well in its place. Hence, in a double summation, 
it is convenient to distinguish between the two summations by re- 
placing n by m in one of them 


= dew (7) + p> + mT —nT) 
= belt) + + mT)... [12] 
= belt) + D> + nT) 


When Equations [11] and [12] are substituted in Equation [8] 


= Gn [x + mr 
+ + + mT — 


... [14] 
When Equation [7] is substituted in Equation [13] 


+ + mT —nT)]..... [15] 
Comparison of Equation [14] with Equation [15] shows that 


bult) = — 


Consequently, Equation[16] is an equation that can be solved 


570 


directly for the weighting function h,. This equation can be 
solved conveniently on the DLS. By interchanging the roles of x 
and y in the derivation, a similar equation can be found that can 
be solved directly for g,. These weighting functions then can 
be checked by use of Equations [7] and [8]. 

To use Equation [16], ¢,,,(7) and ¢,,(7) first must be obtained 
from Equations [12] and [13], respectively. These latter equa- 
tions can be solved readily for the required functions by use of the 
DLS when it is set up to give the weighting function f, as defined 
by Equation [10]. A comparison of Equation [10] with Equa- 
tion [12] shows that the summation on the right-hand side of 
Equation [12] would be obtained as the DLS output by using 
¢,,(7) as the input if there were a minus sign instead of a plus 
sign inside the parentheses. Since the sign is plus, ¢,,(—~7) is 
used as the input to the DLS, and the output is then ¢,,(—r) — 
PwA—T). The resulting curve may be traced and then sub- 
tracted graphically from ¢,,(—1r) to give $,.(—7), or the 
subtraction may be performed electronically, by use of analog- 
computer elements. In the same way, ¢,,.(—1) can be found by 
using Equation [13]. 

To summarize, as shown in Figs. 6(a-e), the steps in finding the 


OUTPUT TO BE MATCHED 
WITH 
BY ADJUSTING DLS 


OLS OUTPUT TO BE MATCHED 
BY ADJUSTING DLS 

(e) 


Fic.6 Summary or Mernop ror DETERMINATION OF OnE WEIGHT- 
1n@ Function or Two-Input System 
(a) Determination of ¢z2, dry, dyy, dzs, ANd dys 
(6) Determination of f, 
(c) Determination of duw 
(d) Determination of due 
(e) Determination of 


TRANSACTIONS OF THE ASME 


weighting function of one input in a system with two inputs z and 
y and a single output z are as follows: 


1 Compute Gey des, and Pye 

2 Using ¢,, as DLS input, adjust the DLS coefficients so that 
the output is ¢,,. In order to match this output it may be 
necessary to shift its r-axis so that the DLS gives, in effect, time 
advances as well as time delays. The DLS, when allowance is 
made for the shift of the r-axis, is now set up to represent the 
weighting function f,, defined by Equation [10]. 

3 With the same DLS settings, apply ¢,,(—7) as input to the 
DLS and subtract the resulting output from ¢,,(—7). This 
gives ¢,,,,(—7)(see Equation [12]). 

4 With the same DLS settings, apply ¢,,(—~7) as input to the 
DLS, and subtract the resulting output from ¢,,(—7r). This 
gives ¢,,(—7)(see Equation [13]). 

5 Using ¢,,,(7) as DLS input, adjust the DLS coefficients so 
that the output is ¢,,(7). The DLS is now set up to represent 
the weighting function h,, and Equation [16] has been solved. 


Systems Wits More Tuan Two Inputs 


Essentially, what was done in the foregoing section was to 
‘fsolate’’ the weighting function h, by “isolating” the component 
of y(t) that is uncorrelated with z(t). This suggests that if there 
are more than two inputs, the component of one input that is un- 
correlated with all the others should be isolated. 


a(t) 


y(t) — _ 


at) 


v 


Fie. 7 Turee-Input, System 


To illustrate the procedure when there are more than two in- 
puts, a three-input system is shown in Fig. 7. Now 


k 
at) =) nT) + — nT) + (17) 


n=0 


When both sides of Equation [17] are correlated with z, y, and v, 
respectively 


$:(T) = — + — nT) 

+ — nT)).... [18] 
= >> — + habyy(t — 27) 

+ —nT)}.... [19] 
= — + — nT) 

+ — 


The component of v which is orthogonal to z and y may be de- 
noted by u. To isolate this component, » may be written in the 
form 


[20] 


v(t) = u(t) + D> — nT) + —nT)).. 
with the result that 
belt) = — nT) 


= — nT) 


|| 
- 
x(t) \ 
/ 2(t) 
(a) 
(b) 
+ 
(c) 
OLS 
+ 
and 


APRIL, 1957 

Equations [22] and [23] can be solved on the DLS in the same 
manner as Equation [10], and give the weighting functions b, and 
c, when ¢,,,(7) is obtained from 


= bolt) + D> + nT) 


in the same way that ¢,,,(7) is obtained from Equation [12]. 
Other correlation functions that are needed are 


belt) = Pelt) + D> +27) + + nT)]..[25) 


Galt) = bult) + +27) + + nT)].. (26) 
From Equations [18] through [26] it follows that 


Gault) = — nT) 


Thus a means is achieved for solving for the weighting function g, 
directly on the DLS when ¢,,(7r) and ¢,,(7) have been found from 
Equations [25] and [26], respectively. These equations can be 
solved by use of the DLS in the same way as for Equations [12] 
and [13]. 

This procedure may be extended to any number of input varia- 
bles; for example, if there are four inputs, the first step is to 
find the weighting functions relating the fourth input to z, w, and 
u. As the number of variables is increased, however, the amount 
of work involved increases out of all proportion to the increase in 
the number of variables. Also, if the correlations are obtained 
experimentally, they must have high initial accuracy in order to 
survive a number of subtractions with meaningful results. 


Systems More Tuan One Output 


A system with more than one output introduces no further com- 
plications because each cutput can be treated independently of all 
the others. Thus a system with n outputs can be treated, for 
purposes of determining weighting functions, as n independent 
systems, all having the same inputs. 


Errect or NEGLECTING an INPUT 


The results of the preceding discussion may be applied to show 
the effect of neglecting one input in the determination of the 
weighting functions of a system with two or more inputs. As an 
example, the effect of neglecting y(¢) in the system of Fig. 3 may 
be considered. 

It is clear that if z(t) and y(t) are uncorrelated, the weighting 
function g(t) can be determined from z(t) and 2(t) alone, and no 
error can result from neglecting y(t). Thus, if y(t) is a noise dis- 
turbance uncorrelated with z(t), it can have no effect on the de- 
termination of g(t). However, if z(t) and y(t) are correlated, the 
weighting function that would be found from 2(t) and 2(t) alone, 
when 7(¢t) is neglected, would not be g(t). Instead, it would be 
g(t) plus the cascaded effect of f(t) and h(i). In terms of the 
equivalent causal path of Fig. 5, the weighting function found 
would be not the response for a unit impulse at the z-input to the 
system but rather the response for a unit impulse in z(t) applied 
“upstream’”’ of the equivalent causal path joining z(t) and y(t). 
This result can be generalized to systems with any number of in- 
puts. 

NonLINEAR SysTEMS 


Since the weighting function is a linear representation of a 
system, the application of the techniques described in the fore- 
going section to a system that is in fact nonlinear still gives a 
linear representation. In this section it will be shown that this 


571 


linear representation is the best linear representation of the system 
(in a mean-square sense) obtainable from the available data. To 
indicate the range of application of this linear representation in 
control problems, reference then will be made to a number of re- 
cent control-system studies using linear representations of non- 
linear systems. 

Since the equations of this paper are written for continuous in- 
put and output functions but for discrete weighting functions, two 
treatments of the problem are given. In the first treatment, given 
in this section, input and output functions as well as weighting 
functions are considered only at discrete points in time; this 
treatment is based on “least squares.’”’ In the second treatment, 
given in Appendix 1, weighting functions as well as input and 
output functions are treated as continuous; the solution then can 
be specialized to discrete points in time to give the equations used 
earlier. Both treatments are similar to the corresponding treat- 
ments which have been used to demonstrate methods for com- 
puting optimum linear filters (5). Throughout the discussion the 
assumption is made that the systems themselves are time in- 
variant and that their inputs are statistically stationary; the 
implications of nonstationary inputs are discussed in Appendix 2. 

Single-Input, Single-Output System. In the system represented 
by Fig. 1 the supposition is made that the system is now non- 
linear, but that it is desired to represent c(t) as closely as possible 
by a series of the form 


> h,m(t — nT) 


As the standard of ‘‘as closely as possible,’’ the condition is used 
that the mean-square error of approximation 


M =[(t)— hym(t— nT) |* 


should be a minimum, where the bar denotes an average taken 
over all available data. 

To choose the h, so as to make M a minimum, a necessary con- 
dition is that 


Now 
M = cX(t) —2 hym(t—nT eft) 


hence 


= —2 + 2D, h, mt —iT aT) 
‘ n 


= —26,(iT) + 2)> — nT) 


Thus Equation [29] requires that 
GndiT) = —nT), i= 0,...,k.-.. 130] 


To show that Equation [30] actually makes M a minimum 
(rather than a maximum or a value that is neither maximum nor 
minimum), any small change in the h, must be shown to result in 
alarger M. Therefore the effect of replacing the h, of Equation 
[30] by (h, + Ah,) is investigated. Equation [28] then becomes 


| 
i= 

| 


572 


M + AM =cX(t)—2)> (h, + Ah,) m(t — nT c(t) 


n r 


with the result 


AM = —2 Ah,, m(t — nT c(t) 


+ (hy Ah, + h,Ah, + m(t — nT )m(t — rT) 


When the h,, are chosen to satisfy Equation [30], AM reduces to 
AM = >> m(t — nT )m(t — rT) 
n r 


[x Ah,m(t — nt) | [x Ah,m(t — | 
[x Ah,m(t — nT) | 


which is positive for all possible values of Ah, as long as there is 
a nonzero input m(t). Thus, since any change in the h, from 
the values given by Equation [30] results in an increase in M, these 
values do make M a minimum. In the course of this proof the 
subscripts n and r have been interchanged freely since they are 
merely dummy subscripts of summation. 

A comparison of Equation [30] with Equation [5] shows that 
while for a linear system Equation [5] should be fulfilled for all 
points of time, for a nonlinear system, Equation [5] should be ful- 
filled only at the values of 7 corresponding to the values of ¢ for 
which the weighting function is sought. This gives a (k + 1) by 
(& + 1) matrix equation which could be solved for the h, by 
conventional methods. If the weighting function of a nonlinear 
system is found by using the DLS, then Equation [5] should be 
satisfied for the range 


because the discrete points in Equation [29] are generally close 
enough together so that their individual identity need not be pre- 
served. 

The conclusions just reached should be compared with the con- 
clusions reached in Appendix 1 of reference (3). There it was 
shown that when there is noise in a closed-loop system, Equation 
[5] should be satisfied in a region, r = A, far enough to the right 
of the origin to eliminate the effect of the noise. Thus, in obtain- 
ing the weighting function of a nonlinear closed-loop system with 
noise, the engineer must use judgment in compromising between 
satisfying Equation [5] in the region of 7 that gives the optimum 
linear representation of the system and satisfying Equation [5) in 
the region that minimizes the effects of noise in the closed loop. 

Two-Input, Single-Output System. By application of the same 
procedure to the system represented by Fig. 3, the function to be 
minimized is now 


M= [ g,2(t — nT) — >. h,y(t — nT) (31) 
The choice of g, and h,, to achieve this result requires that 


M M 
0 and 


Equating to zero the derivatives of M with respect to the g, yields 
= — nT) + — (83) 


[32] 


= 0, 


TRANSACTIONS OF THE ASME 


and equating to zero the derivatives of M with respect to the h,; 
yields 


= D> — nT) + hybyy(iT — nT)]... (84) 


To show that the values of g, and h, given by Equations [33] and 
[34] actually do make M a minimum, the same procedure used 
for Equation [28] can be followed. 

Comparison of Equations [33] and [34] with Equations [7] and 
[8] reveals that the optimum linear representation of a two-input 
nonlinear system is obtained when Equations [7] and [8] are 
satisfied for the values of rt corresponding to the values of ¢ for 
which the weighting function is sought. 

By an extension of the reasoning of Appendix 1 of reference (3), 
it can be shown (6) that when there is noise in a closed-loop, two- 
input system, Equations [7] and [8] should be satisfied in a re- 
gion, tr 2 A, far enough to the right of the origin to eliminate the 
effects of the noise. Thus, as in the case of a single-input system, 
a compromise must be made between satisfying these equations in 
the region of 7 that gives the optimum linear representation and 
satisfying them in the region that minimizes the effects of noise. 

The results of this section can be generalized to any number of 
inputs. 

Relation Between Two Inputs. By the same type of argument, 
it can be shown that when two inputs z and y are not linearly re- 
lated, the best linear time-series representation of y in terms of z 
(in a mean-square sense) is 


k 
>> —nT) 
where the f, are obtained by solving Equation [10] at the values 
of t corresponding to the values of ¢ for which the f, are sought. 
To show this, the expression 


M = [y(t) — 2f,x(t — nT)}? 
is minimized by setting 

oM 

of; 


By expansion of M as before and by setting the derivatives equal 
to zero 


iT) = >> —nT), = —k,..., [85] 


= 0, 


and by the procedure used previously, it can be shown that these 
values of the f, actually do make M a minimum. 

Use of Linear Representations in Control Problems. While a 
linear representation admittedly does not give a complete de- 
scription of a nonlinear system, it nevertheless gives a description 
that is often useful for control purposes. For many types of non- 
linearities, a system becomes essentially linear when the input 
variations are infinitesimal, and essential information about the 
performance and stability of the system often can be determined 
from the linear model based on small perturbations (7). In 
records made under normal operating conditions, the random 
variations in inputs and outputs are likely to be so small that the 
weighting function determined from the normal operating 
records is -he same as that given by small-perturbation theory. 

When the random variations in normal operating records are so 
large that the essentially linear range of operation of the system is 
exceeded, a weighting function discovered from normal operating 
records may be compared to a “quasi linearization”’ of the system. 
Quasi linearization (8 to 11) is the replacement of each nonlinear 


APRIL, 1957 


element in the system by an equivalent gain which is a function of 
the rms amplitude of the input; for each input-amplitude level, 
the equivalent gain is chosen to give the best approximation, in a 
mean-square sense, to the actual system output. When the input 
is sinusoidal, the quasi-linear representation of the output turns 
out to be its fundamental Fourier coefficient, and hence, quasi 
linearization is equivalent to discarding higher harmonics (8, 9). 
When the input is a random signal with a known probability dis- 
tribution, the equivalent gain can be calculated from autocorrela- 
tions and cross-correlations (10) by use of a method similar to the 
method given earlier in this section. Both the quasi linearization 
based on sinusoidal inputs (8, 9, 11) and that based on random in- 
puts (10) have proved useful in practice for systems that do not 
depart drastically from linearity; the approach based on random 
inputs has the advantage that random inputs are more realistic 
than sinusoidal inputs in an actual control situation. 

The weighting functions determined from normal operating 
records correspond to the weighting functions for a quasi-linear- 
ized system with input-amplitude level equal to the actual level of 
the normal operating input. The use of normal operating records 
has the advantage that the weighting function is based on an 
averaging of data taken under actual operating conditions and 
hence is more realistic than a weighting function based on arbi- 
trary input signals. A disadvantage of using normal operating 
records is that the data are taken only for the input-signal level 
occurring in the system; thus no information is obtained on the 
variation of quasi-linearized system parameters with signal level. 
This disadvantage sometimes can be offset if the system inputs 
have different signal levels under different operating conditions. 

Much work remains to be done on the applicability of linearized 
representations to nonlinear control problems. The work just 
cited, however, indicates that linearized representations contain 
much useful information, especially when interpreted in the light 
of the known physical characteristics of the system. 

To obtain further information about a nonlinear system from its 
normal operating records, the weighting function obtained by 
correlation techniques may be treated as a first approximation. 
The output for the linear approximation can be compared with the 
actual system output, and the difference between these two out- 
puts can be used to determine the effects of nonlinearities under 
operating conditions. This procedure, suggested by Tustin (12), 
can be accomplished readily on the DLS. The weighting function 
is set up on the DLS, and the original system input is played 
through it; the DLS output then is subtracted from the original 
system output to give the portion of the output that is due to non- 
linear effects. 


FREQUENCY RESPONSE 


The frequency-response function, giving the attenuation and 
phase shift of a system for sinusoidal inputs of different frequen- 
cies, is often of interest. When written in complex-variable nota- 
tion, the frequency-response function is the Fourier transform of 
the weighting function (1). 

Three methods of obtaining the frequency-response function 
from normal operating records are: 


1 Use the DLS to solve for the weighting function (Fig. 6); 
then experimentally measure the frequency response of the DLS, 
which serves as a model of the system. This method has been 
found to be simple and effective in practice and is the recom- 
mended method. 

2 Obtain the weighting function by using the DLS or by 
numerical means; then numerically perform a Fourier transfor- 
mation to obtain the frequency-response function. This method 
is not recommended if the weighting function is obtained on the 
DLS, because it does not take full advantage of the DLS. 


573 


3 Use a frequency analysis throughout. Instead of using 
correlation functions, their Fourier transforms, the spectral 
densities, may be used (1). Some simplification results from the 
fact that the Fourier transform of a convolution integral is a sim- 
ple multiplication. Thus, by taking the Fourier transform of both 
sides of Equation [40] (see Appendix 1), which is the continuous 
form of Equation [5] 


®,,,(w) = H(jw) (w).. 


from which the frequency-response function H(jw) can be ob- 
tained merely by dividing the cross spectral density ®,,.(w) by 
the input spectral density ®,,,,(w). This method appears at- 
tractive because of its evident mathematical simplicity; how- 
ever, it has two pitfalls: (a) Since the frequency-response func- 
tion does not show the directions of causal paths in a closed loop, 
there is no way of eliminating the effect of noise in a closed 
loop (see Appendix 1 of reference3). (b) The process of taking the 
Fourier transform of the correlations is usually difficult in prac- 
tice, especially if the correlations are based on short records of 
inputs and outputs. If, in spite of these difficulties, it is desired 
to use equations of the form of Equation [36], the frequency- 
response functions of a multi-input system also can be obtained by 
this means. The equations used are merely the Fourier transforms 
of Equations [7], [8], and [10] through [16] for the two-input case, 
and Equations [18] through [20] and [22] through [27] for the 
three-input case. 

These Fourier transforms are valid only for statistically station- 
ary inputs. 

ConcLusIon 


The method presented here should facilitate the determination 
of weighting functions of systems with multiple, mutually 
correlated inputs from normal operating records. For a nonlinear 
system, the optimum linear representation given by this method 
supplies useful information for control purposes, and may be used 
as a first step toward a more refined description of the system. 


ACKNOWLEDGMENT 


The benefit of discussions of this material with Prof. J. B. 
Reswick of Massachusetts Institute of Technology, and with Mr. 
J. M. Loeb of the Schlumberger Instrument Company, is greatly 
appreciated. 

BIBLIOGRAPHY 


of Serv hani ”" by H. M. James, N. B. 
Nichols, and R. 8. Phillips, McGraw-Hill Book Company, Inc., New 
York, N. Y., 1947, chapter 2. 

2 “Application of Statistical Methods to Communication Prob- 
lems,” by Y.-W. Lee, Research Laboratory of Electronics, Massa- 
chusetts Institute of Technology, Cambridge, Mass., September 1, 
1950, Technical Report No. 181. 

3 “Determination of System Characteristics From Normal 
Operating Records,” by T. P. Goodman and J. B. Reswick, Trans. 
ASME, vol. 78, 1956, pp. 259-271. 

4 “Determine System Dynamics Without Upset,” by J. B. 
Reswick, Control Engineering, vol. 2, June, 1955, pp. 50-57. 

5 “Extrapolation, Interpolation, and Smoothing of Stationary 
Time Series With Engineering Applications,’ by N. Wiener, The 
Technology Press, Massachusetts Institute of Technology, Cambridge, 
Mass., and John Wiley & Sons, Inc., New York, N. Y., 1949. (See 
especially Appendix B, ‘“The Wiener RMS (Root Mean Square) Error 
Criterion in Filter Design and Prediction,”” and Appendix C, “A 
Heuristic Exposition of Wiener’s Mathematical Theory of Prediction 
and Filtering,” by N. Levinson.) 

6 ‘Experimental Determination of System Characteristics From 
Correlation Measurements,”’ by T. P. Goodman, ScD thesis, Depart- 
ment of Mechanical Engineering, Massachusetts Institute of Tech- 
nology, Cambridge, Mass., 1955. 

7 “Study of Pneumatic Processes in the Continuous Control of 
Motion With Compressed Air—Parts I and II,’’ by J. L. Shearer, 
Trans. ASME, vol. 78, 1956, pp. 233-249. 


574 


8 ‘A Frequency Response Method for Analyzing and Synthesiz- 
ing Contactor Serv hani ” by R. J. Kochenburger, Trans. 
AIEE, vol. 69, part I, 1950, pp. 270-284. 

9 “Sinusoidal Analysis of Feedback-Control Systems Containing 
Nonlinear Elements,” by E. C. Johnson, Jr., Trans. AIEE, vol. 71, 
part II, 1952, pp. 169-181. 

10 “Nonlinear Control Systems With Statistical Inputs,” by R. C. 
Booton, Jr., Dynamic Analysis and Control Laboratory Report No. 
61, Massachusetts Institute of Technology, Cambridge, Mass., March 
1, 1952. 

“Contributions to Hydraulic Controls—7, Analysis of the 
Effects of Nonlinearity in a Valve-Controlled Hydraulic Drive,” by 
E. I. Reeves, Trans. ASME, vol. 79, 1957, pp. 427-432. 

12 “The Mechanism of Economic Systems,” by A. Tustin, 
William Heinemann, Ltd., London, England, 1955 Appendix. 


Appendix 1 


Optmum Continuous We1cHTING Functions ror NONLINEAR 
Systems 

When the input and output of a system are given as continu- 
ous functions, the optimum linear weighting function may be 
found as a continuous function of time by using the calculus of 
variations. 

Equation [2] may be made continuous in the weighting func- 
tion h, by passing to the limit as 7 — 0 and k7' — K, giving the 
convolution integral 


c(t) = h(r)m(t — 


The upper limit of integration K must be large enough to include 
all of the significant portion of the weighting function; if neces- 
sary, it may be infinity. 

For a nonlinear system, it is desired to approximate c(t) as 
closely as possible by a convolution integral of this form. The 
problem therefore is to find the continuous function A(r) that 
minimizes the expression 


M= [ ee) rr |’ 


As before, the bar denotes an average taken over all available data. 
If the assumption is made that h(r) represents a stable system (1), 
the convolution integral is finite for all bounded values of m, even 
when K — o. It is therefore permissible to interchange the 
order of the operations of integration and averaging. 

Since h(r) is now a continuous function, rather than a sequence 
of parameters, the calculus of variations must be used to minimize 
M. Therefore, when A(r) is replaced by [h(r) + AA(r)] in 
Equation [37] 


K 2 
M+AM= [h(r) + Ah(r)]m(t — ..- [38] 


Here AA(r) is thought of as a small increment in h(r) that is a 
continuous function of r. By expansion of Equation [38] and 
subtraction of Equation [37], using u as a second dummy variable 
of integration 


AM =—2 f ane — 


— ff ar 


A necessary condition to make M a minimum is that the sum 
of the terms involving the first power of Ah(r) be zero for all possi- 
ble AA(r). This condition is fulfilled if, and only if, the expression 
in the braces is zero throughout the range of integration; that is, 
if, and only if 


TRANSACTIONS OF THE ASME 


With this condition fulfilled, a sufficient condition to make M 
a minimum is that the sum of the terms involving the second 
power of Ah(r) be positive for all nonzero Ah(r). Since the last 
term of Equation [39] is positive for all nonzero Ah(r), except for 
the trivial case m(t) = 0, this condition is also fulfilled. 

Comparison of Equation [40] with Equations [30] and [5] 
shows again that the optimum weighting function for a nonlinear 
system is obtained by matching the cross-correlation for the range 
of r for which the weighting function is sought. 

This result can be generalized to any number of input variables, 
and can be readily extended to prove Equation [35] in continuous 
form (6). 


Appendix 2 


NONSTATIONARY INPUTS 


Many of the equations in this paper were simplified greatly by 
the assumption that the system inputs, and hence also the out- 
puts, are statistically stationary; that is, that their statistical 
properties, including autocorrelations and cross-correlations, do 
not change with time. Inputs encountered in practice may not be 
stationary; even if they are stationary, the length of available 
input and output records may not be sufficient to smooth out all 
the instantaneous variations, with the result that correlations 
based on short records may be nonstationary. 

Equations [1] through [10] are valid for both stationary and 
nonstationary inputs, while Equations [11] through [16] are valid 
only for stationary inputs. Similarly, Equations [17] through 
[23] are valid for both stationary and nonstationary inputs, while 
Equations [24] through [40] are valid only for stationary inputs. 

The error introduced by applying an equation valid only for 
stationary inputs to an input which is nonstationary depends on 
the length of the records available for correlation in comparison 
to the length of the significantly nonzero part of the weighting 
functions. For example, if the available correlations are based on 
(N + 1) ordinate values, spaced at a time interval 7’, while the 
nonzero parts of the weighting functions are of length k7’, then 
the greatest error introduced in Equations [11] through [16] is 
the error of replacing 


+7 +kT) by +r + 2kT) 


Thus the error involves at most only k terms of the N + 1 terms 
which are averaged to form the correlation functions; if k = 20 
and N = 1000, the error may not be significant unless the inputs 
are drastically nonstationary, while if k = 20 and N = 100, the 
error may be appreciable. This illustrates the advantage of using 
a large number of ordinates in computing correlations, although 
this advantage always must be weighed against the greater cost 
in computing time. 

For a two-input system with nonstationary inputs, Equations 
{11] through [16] may be used to give a first approximation to the 
solution; then Equations [7] and [8], which are valid for non- 
stationary inputs, may be used to refine the solution. A similar 
procedure may be used for a three-input system, since Equations 
[18], [19], and [20] are valid for nonstationary inputs. 

To find the best linear representation of a nonlinear system with 
nonstationary inputs, it would be possible to rewrite Equations 
[28] through [35] and [37] through [40] in a much more compli- 
cated form that would make them applicable to nonstationary in- 
puts. However, since the linear representation is at best only an 
approximation, such a procedure might not ever be worth while 
in practice. 


APRIL, 1957 


Discussion 


T. M. Srour.‘ A thorough job has been done in laying a 
theoretical basis for characterization of multi-input systems from 
normal operating records. If experience with the proposed tech- 
niques shows them to be accurate and convenient, control- 
system engineers may find them a useful addition to their tool 
kit. 

The significance of this work in the nonlinear domain is less 
clear. It is well to remember that any quasi linearization of a 
nonlinear system has only restricted utility. The describing- 
function method (references 8 and 9 of the paper) provides a 
characterization for sinusoidal inputs. Ordinarily, but not 
necessarily, the describing function characterizes a system com- 
ponent which is only amplitude-dependent and which may be 
highly nonlinear; e.g., a relay. The describing-function method 
is concerned principally with absolute system stability but sheds 
some light on the relative stability or degree of damping. 

The equivalent gain (reference 10 of the paper) provides a 
characterization for random signals and, again, is ordinarily de- 
termined for a system component which is amplitude but not 
frequency-dependent. The method provides a means, for ex- 
ample, for approximate determination of the mean-square error 
when the system is subjected to a random input. 

The characterization discussed in this paper is a function of 
time which is also amplitude-dependent. (A recent paper® con- 
tains experimental data for a nonlinear servomechanism which 
show a considerable variation in the nature of the weighting func- 
tion A(t) with a change in the amplitude level of the random input 
signal.) If artificial manipulation of the input signal is pro- 
hibited, the amplitude dependence of A(t) can be discovered only 
by waiting for a change in the input amplitude. In a chemical 
process, this delay could be a matter of days and characteristics of 
the plant itself might change before there was any change in the 
nature of the disturbances. 

In addition, the region of application of the weighting function 
A(t) is somewhat vague. It is not clear how A(t) may be used for 
analysis of the system for which it was determined or of some 
larger system containing the component characterized by A(t). 
The weighting function, of course, could be used to compute an 


The Ramo-Wooldridge Corporation, Los Angeles, Calif. 

*“The Use of Correlation Techniques in the Study of Servo- 
mechanisms,”’ by T. M. Burford, V. C. Rideout, and D. 8. Sather, 
Journal of the British Institution of Radio Engineers, vol. 15, May, 
1955, pp. 249-257. 


575 


output for the input from which it was originally determined; the 
result is an approximation to data already on hand and therefore 
not very valuable. Being a linear characterization determined for 
a single input amplitude level, it will give an approximation of in- 
determinate accuracy for inputs differing in amplitude level but 
otherwise similar. Calculation of step or sinusoidal response 
from A(¢) is impossible in principle, so results of any such calcula- 
tions likewise would be of doubtful validity. Perhaps the situa- 
tion may be summarized by saying A(¢) is useful only if the system 
is, in fact, so nearly linear that the term “‘nonlinear’’ is inappro- 
priate. 

Because of the potential importance of the techniques discussed 
in this paper, attention should be given to the problems involved 
in both determination and application of A(t) which arise from 
the amplitude dependence of nonlinear phenomena. Possibly a 
considerable amount of additional work, leading to other papers 
on the subject, will be required to treat these questions ade- 
quately. 


AvutHor’s CLOSURE 


Mr. Stout has performed a useful service in describing more 
fully some of the limitations mentioned in the paper in connec- 
tion with the application of the correlation-and-deconvolution 
technique to nonlinear systems. The author agrees that much 
additional work will be required to define these limitations 
adequately and to establish the range of useful application of this 
technique. This additional work will depend on the availability 
of data from industrial processes and on the co-operation of 
organizations that have access to such data. 

For the present, it appears that the principal application to 
nonlinear systems of the correlation-and-deconvolution tech- 
nique will be to obtain a linear model (the DLS set with the 
coefficients h,,) that can be used in analog studies of alternative 
schemes for controlling a system or of combinations of a system 
with other elements in a more complex control system. For a 
“mildly’’ nonlinear system, this model, representing as it does 
the major dynamic effects in the system, should prove sufficiently 
accurate for many useful control studies. For a “‘strongly’’ 
nonlinear system, it is conceivable that the method suggested by 
Tustin (reference 12 of the paper) can be used to find a correc- 
tion to the linear model that can then be applied to devise a non- 
linear model representing, to some extent, the amplitude depend- 
ence of the system. For a system whose normal operation can- 
not be disturbed, such a model would provide the best availa- 
ble information on the dynamic characteristics of the system. 


Hunting Due to Lost Motion 


By H. PORITSKY,' SCHENECTADY, N. Y. 


Hunting of a servosystem, due to lost motion, say, in one 
of its mechanical links, but in absence of input signals, is 
considered. If the slack is assumed to be taken up sud- 
denly, the motion is governed by a linear differential 
equation but with proper discontinuities when the direc- 
tion of motion in the loose link is reversed. For the case of 
second-order systems it is shown that, if the characteristic 
roots are complex, a periodic hunting motion always exists, 
and that the system, no matter how it is started, will con- 
verge to this hunting motion. If the characteristic roots 
of the second-order system are real, then a periodic hunt- 
ing motion exists, but depending upon how the system is 
started, it may converge to this motion or it may converge 
to a stable position at either end of the lost-motion band. 
Third and higher-order systems are studied in a similar 
way and the equations for determination of periodic hunt- 
ing motion obtained. Second-order systems, in which the 
system “coasts” as the slack is taken up, are discussed 


briefly. 


1 INTRODUCTION 


The following is concerned with hunting of a servosystem, due 
to lost motion, say, in one of its mechanical links. 

The particular system studied is shown schematically in Fig. 1. 
It is designed to make the output or load (motor angle) 0 follow 
theinput 7. The error 


is amplified by means of a hydraulic control to z, in the manner 
described by Equation [3], to produce an acceleration of the out- 
put or load p?@ which is proportional to xz, generated by the con- 
trol. 

It is assumed that lost motion of amount Az = 2h occurs in one 
of the mechanical links of the valve in the hydraulic control, as a 
result of which the motor torque, as well as p?0, is proportional not 
to z but to x — h, z + h according as z is increasing or decreasing. 

We shall be concerned with the case of no input, J = 0, thus 
leading to Equation [4] for the error 6. Without lost motion, the 
system investigated is described by the third-order system of dif- 
ferential Equations [2] to [4]. With lost motion as indicated in 
Fig. 1, Equation [2] is replaced by Equation [5]. 

Since motion without input is investigated, the hunting mo- 
tions, if any, are self-excited. However, some initial displacement 
away from the quiescent position is postulated, such as may exist 
after the signal J has been reduced to zero and the system pre- 
sumably should settle down to rest. We do not consider the in- 
accuracies in follow-up for nonvanishing 7 due to lost motion. 

In equation form the third-order system of Fig. 1, in absence of 
lost motion, is described by (p = d/dt) 


1 Consulting Engineer, General Electric Company. Mem, ASME. 

Contributed by the Instruments and Regulators Division of Tue 
AMERICAN Society OF MECHANICAL ENGINEERS and presented at the 
ASME-AIEE Conference on Nonlinear Control Systems, Princeton, 
N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
19, 1956. Paper No. 56-—IRD-12. 


= Az..... 
(p + B)x = (Cp + D)é... [3] 
06+6=0......... 


where A, B, C, D are positive constants. As just stated, the lost 
motion will be assumed to occur between z and 6; to include it 
one modifies Equation [2] in the manner indicated in Figs. 2, 3, 
and given by 
A(x — h) while z is increasing ] 
.. (5) 
A(x + h) while z is decreasing | 


7 


Fie. 3 


Fig. 2 shows z versus t, while Fig. 3 shows a plot of p*@ versus z. 
While z is increasing with time along the curve ab in Fig. 2, the 
relation between z and p*@ is shown by the straight line ab in Fig. 
3. When z starts decreasing with time, the representative point 
on Fig. 3 shifts suddenly from b to b’, and as z decreases along be 
in Fig. 2, the representative point in Fig. 3 describes the straight 
line b’c. Then, when in Fig. 2 z increases along cd, the line c’d is 
followed in Fig. 3, and so forth. It is evident that in Fig. 3 the 
“thysteresis loops’’ are not of a fixed size, but have their horizontal 
sides at heights corresponding to the maxima and minima values 
of z. 


577 


Fic. 1 
x 
bb 
d 
cc’ 
Fig. 2 
x 
c 
| | 


578 


To start with, a simpler second-order system will first be con- 
sidered, which differs from the preceding one in having p%@ re- 
placed by p@. This system is given by 


(p + + (Cp + =0............. 
A(z — h) while z is increasing 
A(x + h) while z is decreasing 


A corresponding change in replacing A/p* by A/p and in labeling 
the horizontal axis applies to Figs. 1 and 3. 

It is shown that for the system given by Equations [6], [7], if 
the characteristic roots \1, A: of the system are complex, a hunting 
or self-excited oscillation exists, and the system settles down to it, 
no matter bow it is started. If the roots \;, \: are real (and nega- 
tive), i.e., if the system is overdamped, a hunting motion exists, 
but whether the system settles into it or settles to rest in the dead 
band depends upon how the system is started. 

The present study dates from 1940-1941. The author wishes to 
thank the referees for calling his attention to the articles that have 
appeared in the literature in the meantime, some of which are 
listed as references at the end of the paper. 

Many published treatments assume that the periodic hunting 
motion can be approximated by means of sinusoidal motion. Or 
else the first harmonic of the periodic hunting motion is obtained, 
and existing theory of linear systems is modified to include a 
transfer function for the lost motion; this function, however, will 
depend on the amplitude of the sinusoidal component. By con- 
trast, the following does not resort to any such approximations, 
but utilizes exact solutions of the system equations, changing 
from one “‘orbit’’ to another as the z passes through a maximum 
or minimum. Of the several references one by Tustin (1)* comes 
closest to utilizing the same method as the one employed in the 
following. 

While the treatment covers primarily the examples represented 
by the foregoing equations, the method is given in a form which 
can be applied equally well to other linear servosystems with lost 
motion, including systems of higher order. 


2 Seconp-OrpvER SystEM—GENERAL SOLUTION 


We now consider the second-order system of Equations [6] and 
[7]. Suppose first that z is increasing. Applying (Cp + D) to 
both sides of the proper Equation [7] there results 


(Cp + D)p@ = A(Cp + D\x—A).......... [8] 
Interchanging the order of the operators (Cp + D), p on the left- 


hand side and eliminating 0 by means of Equation [6] there re- 
sults 


p(p + + A(Cp + DXx—h) = 0.........19] 
This also may be written in the form 
{A(p) = [p(p + B) + A(Cp + D)}} (2 —h) = 0.. [10] 


since the derivatives of the constant —h vanish. Thus while z is 
increasing, z — A satisfies the second-order differential equation 
[10]. Similarly, it can be shown that 6 + (hB/D) satisfies the 


same equation A(p) = 0, while z is increasing. On the other 
hand, while z is decreasing, it can be shown that z + h and @ — 
hB/D satisfy the same differential equation A(p) = 0. At the 
transition point z and @ are continuous; however, the derivatives 
of x and @ are discontinuous there. 

It will be convenient to introduce the variables 


the paper. 


? Numbers in parentheses refer to the Bibliography at the end of 


TRANSACTIONS OF THE ASME 


where the upper or the lower definitions apply according to 
whether z is increasing or decreasing. Then it follows from the 
foregoing that, while z is either increasing or decreasing, £, 9 
satisfy the homogeneous differential equation 


A(p) = p?+ p(B +AC)+AD=0........ [12] 
as well as the homogeneous system of equations 


pn = Ag 
(p + B)9E+(Cp + D)n =0 


However, £, 7 undergo the sudden changes 
Ag = + 2h 


when z passes through extremal values, the upper signs apply- 
ing when z passes through a maximum, the lower through a mini- 
mum. 

In the following we shall consider the solution in terms of £ and 
n, and plot it in the (£, 7)-plane. We refer to the solutions of 
Equations [13] as “orbits.” The (£, 7) point will follow an orbit, 
say, while é is increasing; then it will jump to a new orbit in ac- 
cordance with the upper signs in Equations [14]. The latter will 
be followed while £ is decreasing, when the opposite Af, An jumps 
occur, etc. 

It is readily shown that the solutions of Equations [13] may be 
expressed as linear combinations of two exponentials 


= A,e™ + 
on Ai ait A: 
n=A [ + e 


where A), A: are the roots (assumed distinct) of the algebraic equa- 
tion 


[15] 


A(A) = A? + A(B + AC) + AD =0........ [16] 


and the A, A: are arbitrary constants. Since the coefficients A, 
B, C, D are real, the roots of Equation [16] are either real or con- 
jugate imaginaries. If the servosystem is not unstable, the real 
parts of the roots must be negative. This will be assumed to be 


the case. 
If the constant A, in Equations [15] vanishes, then Equations 


[15] reduce to 


A 
g 


Hence in the (£, 7)-plane the representative point moves along the 
straight line 


Similarly, if A; vanishes in Equations [15], the point moves along 
the straight line 


|| n= ... 

A 


APRIL, 1957 


If oblique Cartesian axes £, 7; be introduced with these lines as 
their co-ordinate axes, by letting 
A 


Ae 
m=&—— >»... 


[20] 
A 


then it is easy to show that the differential system given by 
Equations [13] transforms into 


dé, 


d 
= 


Ab, . [21] 


Its solution is given by 
& = Bye™, m= Bie 


where B,, B: are arbitrary constants, related to A;, Az as follows 


B= (i—*), 


The variables £, m which cause the system to reduce to the 
simple form given by Equations [21] in which the variables sepa- 
rate, constitute the “normal co-ordinates” of the system. 


3 Roots 


If Ay, A2 are real, then they are both negative, say as shown in 
Fig. 4; we assumed \; # Az. The orbits in the (£,, m)-plane 
described by Equations [22] are then as shown in Fig.5. Except 


(23) 


A - PLANE 


Fia. 5 


for the two axes, they are fractional power parabolas, given by 
= const [24] 


which equation is obtained from Equations [22] by eliminating ¢. 
All the curves given by Equations [22] are tangent to the m,-axis, 
with exception of the two orbits constituting the positive and 
negative &-axis. 

The relation between £, 7 and £, : given by Equations [20] is 
called an “affine linear transformation.” Such a transformation 


579 


between two planes changes parallel lines into parallel lines, 
but in general conserves neither distances nor angles. For 
simplicity we may refer to it as a “skewing” transformation. 
Solving Equations [20] for &, 7 one obtains for the inverse a similar 
affine or skewing transformation. Hence the orbits in the (£, 7)- 
plane are as shown in Fig. 6, and form a skew image of the orbits 
of Fig. 5. 

The two lines given by Equations [18] and [19] correspond, re- 
spectively, to the m, &-axes of Fig.5,in view of Equations [20]. In 
Fig. 6, since Ai, Az are negative, these lines lie in the second and 


Fic. 6 


fourth quadrants. Except for the £,-axis, the orbits are tangent to 
the m-axis. 

By eliminating dt from Equations [13] one obtains for the re- 
ciprocal slope of the orbits 


an “2a (c+ [25] 


This slope is a function of n/£ only, and hence is the same at all 
points of the straight lines through the origin. Hence, or directly 
from Equations [15], follows that the orbits are similar about the 
origin. 

It will now be recalled that in the (£, 7)-plane an orbit is fol- 
lowed until the sign of d&/dt starts changing. From Equation 
[25] it will be seen that this occurs along the straight line 


This is shown as a dashed line in Figs. 5, and 6. When the orbit 
hits the lower half of this line, a sudden shift such as PP, corre- 
sponding to the upper signs of Equations [14] takes place; when 
it hits the upper half of this line, a similar shift PP; corresponding 
to the lower signs of Equations [14] occurs. 

Following through the various orbits and jumps, we arrive at 
the possibilities shown on Fig. 7. If after a jump the point lands 
in the regions with slanted shading, no further change of sign of 
dt/dt occurs, and the point moves in toward the origin along a 
proper orbit. This happens along the orbits in the vertically 
shaded regions of Fig. 7. The orbits in these regions are the orbits 


= 
2 2 \ a 
\ 
\ “2, 
/ 
/ 
/ 
/ 

/ — (40+*) 126 


Fic. 7 


Fie. 8 


which after one translation become stable. Between the shaded 
areas are regions where hunting takes place. The hunting mo- 
tions approach a permanent oscillation of fixed amplitude. 

To prove this consider Fig. 8 which is a magnified image of the 
part of Fig. 7 near the origin. It will be noted that the orbits from 
the unshaded region below the negative £,-axis hit the dashed line 
between Oand P. They are then translated to O,P,, and, follow- 
ing the curves of the new region, hit the dashed line again at OP2, 
whence they are translated into P;0;. Thus they have been com- 
pressed to a small region of the original unshaded area, and, in fact, 
they hit the dotted line again between O,P, over a much smaller 
interval than OP. This compression continues as the motion pro- 
ceeds, with z increasing and decreasing alternately, the width of 
the corresponding interval decreasing eventually in geometric 
ratio. The various motions, therefore, approach a certain limit- 
ing curve which is the present oscillation or hunting motion in 
question. 


TRANSACTIONS OF THE ASME 


The amplitude of the permanent oscillation is proportional to 
h, and likewise the width of the region from which the permanent 
oscillation develops is proportional to h. 

While so far the motion has been described in the (£, 7)-plane of 
Figs. 7, 8, it is of interest also to describe it in the original (z, @)- 
plane. Recalling Equations [11] it will be found that the upper 
signs apply to the left of the dashed line P.O . . . P of Fig. 8, 
where z is increasing along the orbits. This half plane then 
suffers a translation relative to the (£, 7)-plane of Fig. 8 so that 
the origin O is moved to O;. Similarly, the half plane to the right 
of this dashed line is translated so that O is moved to O;._ The mo- 
tion in the (z, @)-plane can therefore be described in terms of 
skewed images of fractional-power parabolas with vertexes at two 
different points, O,, O;. There is the advantage that the jumps 
are eliminated. On the other hand, the orbits converging to 0; 
and O; overlap each other. The ‘‘stable’’ motions converge to O; 
or 03. 

It is evident that the periodic hunting motion is by no means an 
ellipse, which it would be if z and @ were sinusoidal in time. 


4 Roots 
If A:, A: are complex, then as shown in Fig. 9, they are given by 


A=uti, A= 


PLANE 


where yu, v are real and yw is negative. Equations [15 to 23] and 
the transformation to &, 7; are still valid, but for real £, 7 lead to 
complex £, m which are conjugate complex of each other. 
Similarly, the straight lines given by Equations [18, 19] turn out 
imaginary. 

This difficulty is remedied by introducing a further transforma- 
tion 


MA +A 
. . [28] 
1— Az v 
nm: = — m)/2i = n= n | 


Since in Equations [15] A; and A: are conjugate imaginaries, so are 
also B,, Bin Equations [22], and £ and m; hence &, 72 are real 
and are given by 


& = Ri) =R [4 (: a] 
m= =1 [4 (: *) 


where R(z), I(z) denote, respectively, the real and imaginary parts 
of z. 
From Equations [29] it follows that the points (£2, 72) describe 
a logarithmic spiral in the (£2, 72)-plane as shown in Fig. 10. 
Equations [28] show that between the (£, 7)-plane and the (£, 
N2)-plane there’ exists a relation of the same kind as between the 


. [29] 


| 
580 
| | | 
| 
| 
: O; | 
4 


APRIL, 1957 


planes of Fig. 5 and Fig. 6, namely, a skewing or linear homogene- 
ous transformation, sending parallel lines into parallel lines, but 
deforming circles into ellipses. Thus in the original (&, 7)-plane 
the orbits are skewed images of logarithmic spirals. 

Equations [25] and [26] still apply here. The points at which 
£ attains its maxima or minima are once more given by the straight 
line Equation [26] in the (£, 7)-plane; in view of the linear 
relation between the variables &, 7 and £2, m2, this straight line 
transforms into a proper straight line through the origin in the 
(£2, n2)-plane shown as a dashed line aOb in Fig. 10. It is now 


Fic. 10 


clear that all orbits, no matter where they start, necessarily inter- 
sect aOb eventually. Thus if & is increasing, it will necessarily 
attain a maximum and start decreasing again. The maximum is 
taken on when the orbit meets the right half Ob of the dashed line; 
then the values of & and 7 are shifted by a constant vector which 
corresponds to upper signs in Equations [14]. Starting along new 
logarithmic spirals from the displaced points, along O’b’, one con- 
tinues along an orbit until it cuts the dashed straight line along 
aO, whereupon a translation in the opposite direction, to a’0”", 
takes place, and so forth. 

It will be noted that the logarithmic spiral which starts far 
away from the origin converges toward it appreciably during one 
of these “‘half cycles.’’ On the other hand, a point starting very 
close to the origin gets farther away from it, largely on account 
of the fixed translation. Thus there is one point on the dashed line 
Ob which after the translation and convergence along the spiral, 
finds itself at the same distance from the origin along ab. This 
point describes a periodic motion. The curves that start on the 
outside of this closed path get smaller and converge toward it, 
while the curves that start on its inside expand and approach it. 

Thus for the case of complex characteristic roots there always 
exists a self-oscillation or hunting due to the lost motion, and no 
matter where the system is started, it necessarily approaches this 
self-oscillation as a limiting motion. 

As in Section 3 the motion may be described in the original (z, 
6)-plane. Here the orbits are continuous and form segments of 
skewed images of equiangular spirals converging to two different 
centers. For small u/v they more nearly resemble ellipses. 

While the foregoing treatment is largely descriptive and geo- 
metric, a purely analytic treatment can also be given. Thisis done 
in Sections 5, 6 for third-order and nth-order systems; the latter 
includes the case n = 2. 


5 Turrp-Orper System—REat Roots 


We consider next the third-order system given by Equations 
[3 to 5], and possessing lost motion as represented by Equation 
[5] and Fig. 3. As z passes through a maximum or minimum, z, 
6, pz, pO are continuous, but p*6 is discontinuous in accordance 
with Equation [5]. 


We introduce the variables 


t=2Fh, =0+hB/D, = pO....... [30] 


where the upper or lower signs apply accordingly as z is increasing 
or decreasing. It is readily shown that between maxima and 
minima of z, the variables &, n, ¢ satisfy the homogeneous system 


(p + BE + (Cp +D)n=0, pn=f, pe = A€..[31] 


while each one is a solution of 


A(p) = p*? + Bp? + ACp + AD =0.. [32] 


When z passes through an extremal value, the variables £, , ¢ 
change as follows 


At = 42h, An = ¥2hB/D, At =0. . [33] 


with the upper signs corresponding to passage of z through a 
maximum, 
In general the characteristic equation 


A(A) = A? + BA? + ACA + AD = 0... [34] 


of Equations [31] or [32] will have three unequal roots, dj, A», As, 
and the solutions for &, , ¢ will reduce to linear combinations of 
the exponentials The roots are either all real 
and negative as in Fig. 11, or else one, say As, is real and negative, 
and two are conjugate complex, with a negative real part, as in 
Figs. 12(a, b). 


(a) (b) 
Fig. 12 


When the roots are real, it is possible to introduce normal 
co-ordinates £;, m, {1 a8 proper linear combinations of &, n, ¢, for 
which the system given by Equations [31] reduces to 

dé, 


dt = Aub, 


dm 


and whose solutions are therefore given by 
& = m = = 
This is done by seeking solutions of Equations [31] of the form 
t= = Be”, = 


581 
a 
| 
~b 
: 
Ay AgAs 
Ap 


582 


where B,, \ are constants. Substitution of Equations [37] into 
Equations [31] leads to three homogeneous linear equations in B,, 
which admit solutions other than B, = B, = B; = Oif and only if 
the determinant vanishes. In this way Equation [34] results. 
Corresponding to each root of Equation [34] (assumed real and 
distinct) the three equations in B,; become linearly dependent and 
of rank 2, and determine unique ratios B,:B;:B;. Equations [37] 
then yield a straight-line solution for each root, and these lines 
form the axes of a set of oblique Cartesian co-ordinates which are 
the normal variables §, m, £1. The positive direction and the 
scale on each axis may be chosen so that Equation [38] in the fol- 
lowing is valid. 

It is readily shown that the solutions possess similarity about 
the origin of the (&, 9, ¢) or (&, m, £1)-space, such that if a conical 
surface with the vertex at the origin be drawn through any one 
orbit Co, the curves on the same conical surface obtained by mag- 
nifying Co by any factor also will be orbits. Therefore the orbits 
may be followed to a certain extent by investigating the intersec- 
tion of these conical surfaces with a (unit) sphere 2 with center at 
the origin. Such a sphere is shown on Fig. 13 where the great 


circles correspond to orbits lying in one of the co-ordinate planes. 
The projection of a general orbit on each co-ordinate plane is a 
fractional power parabola. All the orbits approach the origin— 
this is not indicated in Fig. 13. 

The linear relations between &, and m, may be solved 
for By multiplying m, by proper constants, if necessary, 
& may be expressed as follows 


From Equations [35] and [38] follows 
dz dé 
dr 


Now at a maximum of z Equations [39], [40] yield 
+ m + $1 = 0, Aid: + + Ashi < 0... . [41] 
while at a minimum 
+ m + $1 = 0, + + > 0... . [42] 
The loci Il,, I, constitute two halves of the plane 


divided by the straight line which forms the intersections of IT 
with the plane 


+ + Aah, = 


TRANSACTIONS OF THE ASME 


From the foregoing it follows that an orbit is followed till it hits 
either IT, or IT; and then the changes given by Equations [33] take 
place, or in terms of the normal variables 


Ag, = xa, Am = +b, At [45] 


where a, b, c are proper constants obtained by expressing &, m, 1 
in terms of &, , ¢ and forming similar combinations of the right- 
hand members of Equation [33]. The upper signs of Equations 
[45] correspond, say, to an intersection of the orbit with I, the 
lower signs with 

In Fig. 13 the plane II is shown as the great circle ABC... HA 
forming the boundary of the visible half of 2; the plane given by 
Equation [44] as the dashed half circle AE; the half planes IT,, 
IT, as the 180-deg ares ABCDE, EFGH A, respectively. 

Suppose that an orbit has reached IT, at Po:( £10, mo, (10) satisfy- 
ing Equation [41], say at the time ¢ = 0. Equations [45] show 
that it is displaced to 


Po’ +a, mot+b, So +c) 


It now follows from Equations [36] that the resulting motion will 
continue along an orbit through P»’ 


& = a)e™t 

m = (mo + be 

= (S10 + ce 

until it intersects IT, at P,, or if it never cuts I, forever converging 
to the origin. 

The class of orbits cutting IT, can be obtained from those cutting 

II, by negative reflection through the origin O. It follows that a 

periodic motion will result if P, is the negative image of Po, or 


dropping the subscript 9 if for any &, m, {1 satisfying Equation 
[43] the following equations hold for ¢ 2 0 


+ ale! = —&, 
(m + = —m 
(fi + cle = 


If Equations [48], [43] are satisfied for the same £, m, {1 for 
several positive t, the smallest root is to be used. 

Solving Equations [48] for &,, m, ¢; and substituting in Equation 
[43] there results 


a b c 


At t = 0 the left-hand member of Equation [49] reduces to (a 
+b + c)/2; fort—> o, essentially to ce. Hence ifa +b +c,c 
are of opposite sign, roots of Equation [49] will exist. Thus, at 
least for negative 1 + (a + b)/c, there will exist periodic hunting 
motions. 

Summarizing, it has been shown that for real A; a periodic 
hunting solution exists if Equation [49] has a positive root. 

If Equation [49] has positive roots, the first positive root t = t, 
is substituted in Equations [48] to yield a point P» on IT, 


Po:(& = = —a/(1 + 
m = mo = —b/(1 + e~™™), 
= = —c/(1 + e~™)] 


Half the periodic orbit is then given by Equations [36] with A; = 
£10, As = mo, As = £10 for —t; < t < 0, while the other half is its 
negative image in the origin. 

Some motions will approach the periodic motion; others will 
converge to the origin. 


APRIL, 1957 


In the (£, 7, ¢)-space the orbits form a skew image of the orbits 
of the (£:, m, ¢1)-space described by Equations [36] and the same 
statement applies to the translations following intersections with 
the skew images of IT,, IT,, and to the periodic motions. 

In the (z, 0, ¢)-space the orbits are obtained by effecting op- 
posite translation of the two half spaces to each side of image of II. 

It is clear that a similar treatment can be applied to a servo- 
system represented by a set of linear differential equations of order 
n, and subject to lost motion, at least for the case when the 
characteristic roots Ay, . . ., A, are all real and distinct. One in- 
troduces normal co-ordinates §,,... &, and calculates their in- 
crements +a, ... +a, corresponding to “taking up the slack.”’ 
The determination of the hunting motion can then be reduced to 
the determination of the first positive root of the equation 


t=1 


This, of course, includes the case n = 2 of Section 3; for this 
case the left side of Equation [51] reduces to a sum of two terms. 


6 Turrp-Orpver SystemM—CompLex Roots 


If the three roots of Equation [34] are not all real, then they 
are as shown in Figs. 12(a, b). We proceed to discuss these cases. 

As in the case of real roots, it is possible to introduce normal 
co-ordinates by means of which the system given by Equations 
[31] is reduced to the form of Equations [35]. However, with \,, 
A: complex, asin Equations [27], £, m will also be conjugate com- 
plex for real solutions. One may avoid this difficulty, as in Section 
4, by introducing another set of variables 


& = (& + m)/2, mm = (f—m)/2i, = fi... . [52] 


in terms of which the differential equations become real. Equa- 
tions [52] yield 


+im m=&—im f= h 


Equations [53] show that one may use up the whole (£2, 2)- 
plane as the complex Gaussian plane for the complex variabie &). 
Since 7: is the conjugate of £ no separate representation for it is 
needed. 

Expressing £, in polar co-ordinates 


& = re‘... 
it follows from the Solution [36] that 
r = 
e=argA, +l 
$2 = Ay™ 


Equations [55] show that the projection of the orbit on the 
plane {2 = 0 describes a convergent logarithmic spiral, while its 
distance from this plane decreases exponentially. 

It is evident that in the (£2, 92, {2)-space the conical surfaces on 
which the orbits lie will now have an appearance quite different 
from that of Fig. 13 and will wind about the {>-axis. 

Elimination of t between r and £; shows that the orbits lie on 
surfaces of rotation 


$2 = const (r/*) 


whose sections by planes ¢ = const are fractional power parab- 
olas. If A, > uw, these surfaces are tangent to the {-axis at the 
origin O as in Fig. 14; if \y < u they are normal to the f;-axis at O, 
as shown in Fig. 15. 

By multiplying &, m, £1, by proper constants if necessary, one 
may represent £ in the form 


As in Section 5, one now describes the motion by following the 
orbits described by Equations [55] with z and & increasing until 
the orbit either converges to the origin or cuts the locus 

R(é:) + = 0, REA) + 4 0; II, 
whereupon a translation corresponding to the upper signs of 
Aé: = a, An = +b, 


takes place where a, b, c, are the changes in £2, m2, {2 correspond- 
ing to the upper signs in Equations [33]. The motion is then fol- 
lowed with & decreasing until it converges to the origin or inter- 
sects the locus 


&+62=0, R(&A) + Af >0; IL 


when a translation with the lower signs of Equations [59] takes 
place. 

The left-hand member of the inequalities in Equations [58], 
[60], can be put in the form 


— mv + 


and vanishes along a plane, thus showing that II, II; are half 
planes. 


% 


2, 
Fic. 15 


The search of periodic motions leads to the equations 


(& +a + = 
(f2 + 


for £:, {2 satisfying Equation [64] which follows. Solving for &., 
we obtain 
a + ib 


Substituting in 


hi =& +h: =0 


583 
(53) 
Fic. 14 
| 


584 


there results 


For ¢ = 0, the left-hand member of Equation [65] reduces to 
(c + a)/2. For large t it becomes essentially 


cert + R{(a ib)e™*] 


= ce + e#tla cos vt — b sin vt] 


If As < w as in Fig. 12(a), the last term in Equation [66] pre- 
dominates as ¢ increases and its oscillatory nature shows that 
roots of Equation [65] will exist. If A; > wu as in Fig. 12(b), then 
the term ce predominates for large ¢ and it may happen that no 
roots of Equation [65] exist. A more detailed examination of 
Equation [65] is now required. Certainly if a + c, c are opposite 
in sign roots will exist. 

The two cases are shown in Figs. 14and 15. Fig. 15 corresponds 
to As; < was in Fig. 12(a). Here the surface given by Equation 
[56] on which the orbits lie always cuts the oblique plane Equa- 
tion [64] as ¢2 approaches zero, and roots of Equation [65] will 
exist. Fig. 14 corresponds to A; > uw as in Fig. 12(b). Now the 
surface Equation [56] and the orbits on it, once they are above 
the plane, Equation [64], avoid it as {: approaches zero. Only if 
the translation given by Equations [63] transfers the point from 
one side of Equation [64] and {2 = 0 to the other side is a periodic 
hunting motion possible. 

For the case of Fig. 14, it is evident that once an orbit subtends 
an angle less than 45 deg with the positive (or negative) {:-axis 
it will approach the origin without ever cutting the oblique plane. 

No detailed study of the regions of space analogous to those of 
Figs. 7 and 8, constituting the “‘stable’’ and the “hunting’’ re- 
gions, has been made. 

Summarizing, it has been shown that hunting periodic motions 
certainly exist if A; < 4 and may exist even if A; > wu, though in 
the latter case only for limited initial conditions will the motion 
approach these periodic hunting motions. Half of the periodic 
motion is obtained by finding the first positive root ¢ = 4 of 
Equation [66], substituting it in Equations [62] to solve for &, 
$2 = $1, and substituting the resulting values & = £1, 2 = €20in 


TRANSACTIONS OF THE ASME 


for —t; <t <0. The other half is the negative image in the 
origin of the foregoing half. 

While the foregoing equations appear to differ considerably 
from those of the preceding section, actually they can be made to 
agree with them in form. Equation [65] can be replaced by 
Equation [51] for n = 3, provided a;, a2, a3 denote the increments 
in £, m, £1 when z passes through a maximum, and Equation [57] 


is replaced by 


For a system of degree n with two or more complex roots, the 
determination of the periodic orbits can still be reduced to a solu- 
tion of Equation [51] provided +a, denote the increments in the 
normal variables ¢; in taking up the slack, so that a pair of a; 
corresponding to conjugate complex A; are conjugate complex. 
Or else a real equation may be obtained by combining conjugate 
pairs of the normal variables. 


. [68] 


7 Lost Morton Taken Up GRADUALLY 


A study has been made of the motion when the slack is taken up 
gradually by “‘coasting’’ so that in Fig. 3, bb’ is replaced by a verti- 
cal line along which p*@ (or p@ for the second-order system) stays 
constant. Owing to space limitations this will be omitted except 
for stating the results. For n = 2 and real Ai, A: the hunting mo- 
tions disappear and all orbits tend to rest. For complex Ai, A: the 
hunting motion persists, though the constant vector translations 
of Fig. 10 are modified. 


BIBLIOGRAPHY 


1 “The Effect of Backlash and of Speed-Dependent Friction on 
the Stability of Closed-Cycle Control Systems,”’ by A. Tustin, Jour- 
nal of The Institution of Electrical Engineers, vol. 94, part ILA, 1947, 
pp. 143-151. 

2 ‘Instrument Inaccuracies in Feed-Back Control Systems With 
Particular Reference to Backlash,’’ by H. T. Marcy, Morris Yachter, 
and Jerome Zauderer, Trans. AIEE, vol. 68, part I, 1949, pp. 778-788. 

3 “Backlash in a Velocity Lag Servomechanism,”’ by N. B. 
Nichols, Trans. AIEE, vol. 72, part II, 1953, pp. 462-467. 

4 ‘Analysis of Backlash in Feedback Control Systems With One 
Degree of Freedom,”’ by L. M. Vallese, AIEE publication ‘‘Applica- 
tions and Industry,’”’ March, 1955, pp. 1-4. 


c a + ib 
(66) 


Nonlinear Integral Compensation of a 
Velocity-Lag Servomechanism 


With Backlash 


By C. N. SHEN,' H. A. MILLER,? ann N. B. NICHOLS? 


When backlash is encountered in a velocity-lag servo- 
mechanism, the system may become unstable. The loop- 
gain limitation imposed is becoming unacceptable with in- 
creasing requirements for following steep ramp inputs asin 
certain machine-tool control applications. Integral com- 
pensation is not applicable since, with backlash, oscilla- 
tions result. This paper analyzes an attempt to overcome 
this difficulty by intentionally incorporating a second non- 
linearity, a dead zone in the input to the integrator. Re- 
sults are given which allow the approximate transient re- 
sponse of certain systems to be obtained quickly and the 
question of stability is considered. Finally, an example of 
its application is given along with a discussion of several 
practical factors involved. 


INTRODUCTION 


\ Y ARIOUS investigators have used linear-system compen- 
sation techniques to obtain acceptable performance from a 
number of nonlinear feedback systems. To date, how- 
ever, only a few efforts have been reported in utilizing nonlinear 
compensation methods in nonlinear systems in an effort to obtain 
superior results. McDonald (1)‘ and others (2 to 5) have worked 
on various variable damping systems as well as on the so-called 
“dual-mode servo.’’ Sherrard (6) has designed a nonlinear filter 
to stabilize an oscillating system while Markusen and Keeler (7) 
used a nonlinear filter to discriminate against noise in a nonlinear 
system, Truxal (8) discusses nonlinear damping methods with 
the aid of the phase plane and Chestnut and Mayer (9) utilize 
frequency-response methods to study variable gain and variable 
time-constant systems. 

The problem of backlash is a common one in servomechanism 
practice (10). When it is encountered in a simple velocity-lag 
system, which normally is absolutely stable, the system may be- 
come unstable (11). An upper gain limitation is imposed. Even 
without backlash the allowable gain is limited by the require- 
ment of adequate damping. In many practical situations these 
limitations are acceptable but with increasing requirements for 
following steep ramp inputs (such as is becoming common in 
machine-tool control) the system error resulting from the limited 
loop gain may be excessive. Integral compensation is applicable 


1 Assistant Professor of Mechanical Engineering, Thayer School of 
Engineering, Dartmouth College, Hanover, N. H. 

2 Research Staff Member, Raytheon Manufacturing Company, 
Waltham, Mass. 

3 Manager, Commercial Engineering Department, Raytheon Manu- 
facturing Company, Waltham, Mass. Mem. ASME, 

« Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by Instruments and Regulators Division of Tae 
American Society oF MEcHANICAL ENGINEERS and presented at 
the ASME-AIEE Conference on Nonlinear Control Systems, Prince- 
ton, N. J., March 26-28, 1956. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
3, 1956. Paper No. 56—IRD-3. 


to such a situation when backlash is absent and reduces the 
steady-state actuating signal to zero. With the backlash present, 
however, an oscillation limit cycle results for any positive (db) 
loop gain. This paper analyzes an attempt to overcome this 
difficulty by intentionally incorporating a second nonlinearity, a 
dead zone in the input to the integrator. The resulting piecewise 
linear set of differential equations is studied and the numerical re- 
suits are given in a form which allows the approximate transient 
response of the system to be conveniently obtained. The ques- 
tion of stability is considered and, finally, an example of the appli- 
cation of this compensation method is given along with a discus- 
sion of several practical factors involved. 


Tue NONLINEAR SysTEM AND EqQuaTIONsS 


The nonlinear system to be considered is shown in Fig. 1 and is 
the mathematical model of what is considered a significant 
problem. It is a piecewise linear system with no saturation, 
coulomb friction, or noise effects. Loading and coasting effects at 
6, are neglected. It is a velocity-lag servomechanism with back- 
lash and a proportional plus integral amplifier which has a dead 
zone of range +E, to —E, in series with the integrator input. 
The integrator is assumed ideal; i.e., with no input the output re- 
mains constant. A more practical nonideal integrator, such as a 
resistance-capacitance network, would introduce another parame- 
ter and is not considered quantitatively in this paper. 

The motor angle 6,, for a constant-field armature-controlled 
electric motor is related to the manipulated voltage H,, by the 
following differential equation 


E,, is a function of the actuating signal and depends on the action 
of the nonlinear compensating circuit. The integrator input is 
given by 


|E| < E, 


1 t 
(E — E,)dt + E,. E>E, 
0 


t 
1 
Lf (E + E,)dt + Ei. E < —E, 
T Jo 


where 7' is the integral time of the integrator, E,, the initial value 
of the integrator output, as indicated by the second subscript, and 
E, a positive constant which denotes the width of the dead zone. 
Then 


The relation between FE and the actuating signal @, is defined as 
E=K/J..... 


where K, is a positive constant. 
The effect of backlash is given by Equation [5] and shown 
graphically in Fig. 2. The letter 6 represents half the free play. 


585 


6, 
Tn Gn + at errr 
| 
— 
| 


586 


As shown, it is assumed that there is no coasting of the output @, 
during reversals of the motor 


d6, 
dt 
| dt ere 
dé, 
— Om |< | 


Dynamics AND LINEAR REGIONS OF THE SYSTEM 


The dynamies of the system are complicated by both the back- 
lash and the dead zone. Normally there are four possible regions 


TRANSACTIONS OF THE ASME 


in each of which a linear differential equation governs the motion. 
As is usual in piecewise linear systems the terminal conditions of 
one region become the initial conditions of the following region. 
These regions are shown in Fig. 3. For convenience the initial 
values of the motor angle and its rate of change are defined as 


46,,| [7 
[7] 
and 
BACKLASH In region A the motor angle takes up the 
8 backlash until 0,, = @n,— 26 while the output 
6 angle 0, remains constant at @,,. Then 0, leads 


8m 
Ors Se + AG 


E 
DEAD ZONE] INTEGRATOR 


d Tp 


6. by the amount 4 in regions B, C, and D. A 
third-order differential equation applies in region 
B until the integrator stops functioning tem- 
porarily after E = —E,. Since there is no inte- 
gration while —E, < E < +£,, the motion in 


region C is governed by a second-order differen- 


Fig. 1 THe Nonurnear System With NoN.uINgEAR INTEGRAL ConTROL tial gquation and ends when E = +E,. A third- 


36 


--36 
Fie. 2 BackLasH CHARACTERISTIC 


8m 


REGION 


®mo 


t 


Fie. 3 Norma Reaions or Operation, Case 1 


order differential equation takes over in region 
D until the time derivative of the motor angle and hence the 
output angle is zero. This completes a half cycle of oscillation; a 
similar technique can be repeated for successive half cycies. The 
detailed equations are given in normalized form in the Appendix. 

Complications arise since it is not known beforehand whether 
the system will terminate (d6,,/dt = 0) or pass through a given 
region. An understanding of the behavior is perhaps best ob- 
tained by showing examples for various initial conditions. 

The motor and output angles, as well as the voltage EZ versus 
time are shown in Fig. 4, for a case where £ initially lies outside 
the integrator dead zone. . The backlash is taken up in region A; 
the system proceeds through the third-order region B and 8,, be- 
comes a minimum in the second-order region C. Fig. 5 shows 
similar curves for another case where E initially lies outside the 
integrator dead zone, but because the initial integrator output is 
large 0, becomes a minimum in region B. In Fig. 6 the initial 
value of EZ is within the integrator dead zone. The system starts 


REGION REGION REGION 


Fie. 4 Recrions or Operation, Case 2 


= 
26 
8m 
| 
\8c t 
re) t 
A 8 
A B 
E, 


APRIL, 1956 


REGION A REGION B 


Fie. 5 Reaions or Operation, Case 3 


REGIONC, 


REGION D 


Fie. 6 ReGions or Operation, Case 4 


in region C and terminates in region D. In Fig. 7 the initial value 
of EZ is again within the integrator dead zone but the integrator 
output voltage is such that the system both starts and terminates 
in region C. This case is completely described by second-order 
differential equations. In Fig. 8, 2 again lies outside the integra- 
tor dead zone. The initial value of the integrator output is suf- 
ficiently large and of the proper sign to cause the initial accelera- 
tion to be positive. This takes up backlash in region A and 


8c 


REGION C, REGION Co 


REGION A REGION B 


Fic. 8 Reoions or Operation, Case 6 


terminates in the third-order region B, at a maximum output 
angle. The situation in Fig. 9 is similar to that in Fig. 8 except 
that Z is within the integrator dead zone. 

There are a few cases where the full amount of backlash cannot 
be taken up and thus a reversal of direction follows. While these 
cases can be analyzed on a full-cycle basis this will not be under- 
taken in this paper. 


NuMERICAL RESULTS 
To describe fully the transient solution of the system of dif- 


587 
®mo om Smo 
@ 
E, 
AS. 
d e* \ 
oak 
Fic. 7 or Operation, Case 5 
Om 
8mo 
6 
mo 
8 
t 
— 


Or REGIONC,  REGIONB 


Fic. 9 ReGions or OPERATION, 7 


ferential equations requires a detailed solution for each set of 
parameters and for each set of initial conditions. To obtain a 
reasonably simple description of the behavior of this system and 
one which is useful in describing the transients resulting from a 
wide variety of initial conditions, a set of graphs is presented 
which gives the magnitude ratio (final value divided by initial 
value) of the output, the output of the integrator, and the time 
duration all at the moment when the output shaft derivative 
becomes zero. The time duration will be called the half period. 
Thus, by repeated use of these graphs, it is possible to plot the 
maxima and minima of the output function as a function of 
time for many given initial conditions. 

The number of parameters which must be handled has been re- 
duced by choosing the system gain and integrator time in a 
manner to give satisfactory performance for very large actuating 
signals, i.e., when the nonlinearities are insignificant. For such 
a linear system it is known that if K,K,,7,, = 1.333, and T,,/T = 
0.2778, a damping ratio of £ = 0.316 will be obtained. These 
figures result in a full-cycle magnitude ratio of 0.125 and corre- 
spond to the case where the oscillating term decays at the same 
rate as the nonoscillating term. This is considered to be a reasona- 
ble adjustment of a third-order linear system (12). The compu- 
tations in the remainder of this paper pertain only to the foregoing 
numerical values. 

For convenience, the symbols for the initial and final dimen- 
sionless output angle are redefined as 


TABLE 1 


n (vs (Peon 

10.0 
2 —0.60 -0.17 
3 —0.37 —0.03 2.3 
4 —0.66 —0.17 —1.5 
5 —0.66 ~0.16 0.99 
6 —0.80 —0.23 —0.79 
7 —0.84 —0.25 0.66 
8 —0.87 —0.28 —0.57 
9 —0.88 —0.28 0.50 
10 —0.89 —0.30 —0.45 
11 —0.88 —0.29 0.40 
12 —0.87 —0.29 —0.35 
13 —0.84 ~0.27 0.29 
14 —0.74 ~0.20 —0.21 
15 ~0.20 0.14 0.042 
16 -1.5 —0.70 —0.063 
17 0.24 0.47 —0.015 
18 2.44 2.0 —0.037 
19 0.742 0.82 —0.027 


TRANSIENT COMPUTATION EXAMPLE; » = 0.2 


TRANSACTIONS OF THE ASME 


where the subscript f indicates the final or terminal value at the 
end of a half period. Also, the initial and final integrator output 
as well as the final output angle are defined in terms of the initial 
output angle 


Ay Ey 


Be Peo O(Pmo 1)’ B, Der + 1) (11) 


The results presented in Figs. 10, 11, and 12 are for 7 = 0.2 and 
for several values of 8,. To illustrate the use of these curves it is 
convenient to show an example. Let 7 = 0.2, ¢,, = 10.0, and 
8, = 0. From the graphs the following is first obtained 


(A); = —0.60 = —0.17 = 3.11 


The final values of the output angle and the integrator output 
at the end of the first half cycle are taken as the initial values of 
the second half cycle. Thus 


= (Deshi = = —6.0 
(Ade = (Ay = = —1.7 


(Y): 

(8,)2 (By): (A): 

The numerical subscript indicates the number of the half cycle. 

The curves on the graphs are symmetrical with respect to the 

ordinate axis. In other words, the curves are applicable to nega- 

tive values of @,, as well as positive. Taking the absolute value 

of (@,,): a8 6.0, and (8,)2 as 0.28, from the graphs we find (A); = 

—0.37, (y)2 = —0.03, (o,)2 = 3.40. Table 1 continues the com- 
putation and the result is shown in Fig. 13. 


= 0.28 


SymMMeETRICAL Limrr CYcLe 


The system with a sustained oscillation or limit cycle will 
have a particular wave shape when the magnitude is plotted ver- 
sus time. The shape, the magnitude, and the period will be 
constant. Moreover, in the phase space where ¢,, dd,/do, and 


(o)n (Bo)n Ze, 
0 0 3.11 3.11 
—1.7 0.28 3.40 6.51 
0.18 0.081 3.63 10.14 
—0.37 0.26 4.13 14.27 
0.24 0.24 4.60 18.87 
—0.23 0.29 5.14 24.01 
0.20 0.30 5.50 29.51 
—0.18 0.32 5.92 35.43 
0.16 0.32 6.28 41.71 
—0.15 0.34 6.72 48.43 
0.13 0.33 7.16 55.59 
—0.12 0.33 7.83 63.42 
0.095 0.32 8.90 72.32 
—0.058 0.27 12.40 84.72 
—0.029 —0.70 
—0.029 0.47 
—0.030 2.0 
—0.030 0.82 
—0.030 1.11 


588 
on 6 6 
Ké 


APRIL, 1957 


5 


0.2 0.5 10 2.0 
Fic. 10 Macnitrupe Ratio For = 0.2 


SS 


a2 0.5 1.0 2.0 


Fie. 12 Periop ror = 0.2 


| 
NS 


0.5 1,0 2.0 5.0 


%co 


Fie. 11 Intrecrator Output Ratio For = 0.2 


NUMERICAL 


GRAPHICAL 


ot. 


Fic. 13 Comparison oF TRANSIENT RESULTS FOR 9 = 0.2 


d*p,/do* are variables, the limit-cycle curve is symmetrical with 
respect to the origin in this space. For a symmetrical system the 
magnitude ratio after a half-cycle in @ will be —1, i.e. 


Since the shape of the curve remains the same the acceleration 
also remains the same after a full period, and after a half cycle the 


589 
| 
-| 
0.1 5.0 10.0 


590 


ratio of the accelerations also will be —1. It also can be shown 
that the output of the integrator after a half cycle is 


Ev. 


If this quantity is divided by ¢,. 
y¥=—48... 

For a system with sustained oscillation the two Relations [14] 
and [16] must be satisfied. Figs. 14, 15, and 16 show the plots of 
A, y, and ¢ for 9 = 0.1 and various 8,. In Fig. 15 we can locate 
the locus of y = —f,. This curve is shown on the §, versus ¢,,- 
plane in Fig. 17. The curve of 8, versus ¢,, at A = —1 can be ob- 
tained from Fig. 14. This is also plotted in Fig. 17. 

If sustained oscillation exists, the two curves will intersect. The 
corresponding values of 8, and ¢,, can be read immediately. The 
half period o can be determined by these values from Fig. 16. 
As an example for 7 = 0.1 two intersections are obtained. 


Set 1: $,. = 0.71 B, = 0.31 and ao = 5.1 
Set 2: ¢,. = 0.128 B, = 0.36 and o = 14.4 


1.0 


2.0 


co” *mo™ 


Fic. 14 Magonitupe Ratio ror 9 = 0.1 


Aza 


SE 


0.1 0.2 0.5 1.0 2.0 
%co* 
Fic. 15 IntreGrator Output For 7 = 0.1 


TRANSACTIONS OF THE ASME 


TABLE 2 CONVERGENCE OF THE OSCILLATION 


The sustained oscillation occurs at Set 1 with the higher value of 
¢-. 28 shown by the example in Table 2. Although the initial 
values, (¢,.)e = 0.13 and (8,)o = 0.30 are very close to Set 2 yet 
oscillations are sustained at Set 1. 


STABILITY 

One of the important factors in selecting a compensation sys- 
tem for a feedback control system is that of relative stability. 
It is not sufficient merely to assure that the system is stable in a 
mathematical sense, but it also must possess adequate damping. 
In examining transients of the system under study it has been 
observed that 8, is relatively constant for a few cycles in almost 
every case for medium signals. This is true for a region of the 
output shaft angle and it is possible to determine a maximum 
value of the magnitude ratio A for a truly constant 8,. When this 
is done it seems reasonable to use this maximum magnitude ratio 
as an index of stability. With this assumption, successive half 
cycles of the transient decrease by at least the value of the maxi- 
mum magnitude ratio each half cycle. Thus, if this ratio is less 
than unity, the system is stable, and if it is much less than unity it 
is adequately damped. When this calculation is carried out for 
various values of the dead zone 7 it is seen that for adequate sta- 


20 


\\ 


02 0.5 1,0 2.0 
%co* #mo™! 


Fic. 16 Pertop For = 0.1 


- 
0.13 0.30 
| —1.08 —0.42 —0.1405 0.389 
—1.36 —0.54 0. 288 0.397 
-—1.31 —0.49 --0.377 0.374 
—1.24 —0.44 0.467 0.355 
—1.16 —0.38 —0.542 0.328 
—1.09 —0.355 0.591 0.326 
| —1.07 —0.335 —0.633 0.313 
—1.04 —0.320 0.659 0.308 
—1.03 —0.315 —0.679 0.306 
—1.02 —0.305 0 6¥2 0 300 
| —1.02 —0.310 —-0.705 0 304 
—1.01 —0.305 0.712 0.302 
0.5 
° | 
A 
B 
-1.0 %co 
| 
| 
“2.0 
0.1 0.2 0.5 Ke) = 5.0 
| | 
| | | 
~ } | 
= 1.0 
10 
iz 
> a 
24 
°O 


APRIL, 1957 


12 —T 

—— LOCUS OF &-- ¥ 
WELLES OF Bo AT A=-l 
T 


Fie. 17 oF Limit CycLe For » = 0.1 

bility in a great many systems, with the gain parameters chosen 
in this paper, that 7 must be greater than 0.5. For 7 = 0.5, 
the maximum magnitude ratio is 0.58. 


Tue Orrset PrRoBLEM 


Careful examination of Table 1 will reveal that as time in- 
creases the integrator output finally achieves a constant value. 
This is shown in Table 1 in the column labeled A, and the terminal 
value is —0.30. This is called offset and will correspond to a final 
output shaft angle different from zero.6 The magnitude of the 
offset is the value of the integrator output when £/K,6 enters the 
region between +7 and —7 without subsequently going out of 
this region. With an ideal integrator the output remains at this 
value. 

If the offset is greater than 7 eventually it will cause £/K,6 to 
exceed 7 and this will reduce the integrator output. Hence the 
maximum offset is 7. Also, if the initial conditions of the system 
are 


do 
lone — 1 =0, = 1.0 


the system will be at rest and the backlash will not be taken up in 
finite time. This is equivalent to saying that the final offset mag- 
nitude can be any value between +7 and —7. 

Unfortunately, a system with a high value of 7 will have a wide 
range of offset. In this regard alone, 7 will be chosen as small as 
possible. Another possible solution to the offset problem is to 
use an integrator which will not retain its output indefinitely 
without an input signal. Thus, if a resistance-capacitance circuit 
is used as an integrator, the offset eventually will become zero. 
Presumably a relatively long decay time would be used, but even 
in this circumstance it is not certain that the results of the analy- 
sis presented in this paper apply. The analysis should be re- 
examined with this alternative integrator in mind to determine 
the possible effects on the results of the analysis. 


EXAMPLE OF APPLICATION 


In applying this compensation method to a practical situation, 
the system is first adjusted for the most satisfactory operation 
without the nonlinear compensator since for small signals the com- 


§ This is a result which would not be predicted by describing func- 
tion analysis methods (13). 


591 


plete system operates in this mode. Thus, for a given 7’,,, with 
the backlash reduced to as small a value as practicable, K,K,, is 
set. K,K,, is the velocity error constant and, as shown by Nichols 
(11), the product K,X,,7,, must be less than 3.046 for stability. 
On the basis of the previous assumptions, 7 is already determined 
by the large signa! performance and 7 is set for acceptable damp- 
ing in the medium signal area. 

In considering a single-motion machine-tool contouring control, 
experience indicates that the backlash in almost all machine tools 
can be reduced to 0.015 in. and for this situation a velocity-error 
constant of 20 sec! is a typical maximum. Such machines are 
currently desired to follow the equivalent of a ramp input of 50 
ipm. Thus the steady-state error would be 50/(60 K 20) = 0.042 
in. With the nonlinear compensation adjusted for » = 0.5, the 
steady-state error would be reduced to at most 0.0075 in. or 18 
per cent of the value without the compensation. The steady-state 
error for zero slope inputs is scattered within +0.0075 in. if the 
ideal integrator is used, but with a modified integrator it seems 
likely that this impairment can be reduced to zero. 

Looked at in another way the new system is desirable for all 
ramp inputs exceeding 60 K,né ipm, if the observed stability is 
acceptable. For the above case, this is 9.0 ipm. 


CoNCLUSIONS 


In certain circumstances with relatively large values of back- 
lash and where it is necessary to follow a ramp input, this system 
may prove advantageous. The steady-state following error is 
reduced but an offset is introduced. Means are indicated which 
may reduce this impairment. With this method the steady-state 
system error is no more than half the backlash for step and ramp 
reference inputs. 


BIBLIOGRAPHY 


1 “Nonlinear Techniques for Improving Servo Performance,” 
by D. McDonald, Proceedings of National Electronic Conference, vol 
6, 1950, pp. 400-421. 

2 “A Topological and Analogue Computer Study of Certain 
Servomechanisms Employing Nonlinear Electronic Components,” by 
R. C. Lathrop, PhD thesis, University of Wisconsin, 1951. 

3 “The Use of Nonlinear Feedback to Improve the Transient 
Response of a Servomechanism,”’ by J. B. Lewis, Trans. AIEE, vol. 
71, part 2, 1952, pp. 449-453. 

4 “Optimization of Nonlinear Control Systems by Means of 
Nonlinear Feedbacks,” by R 8S. Nieswander and R. H. MacNiel, 
Trans. AIEE, vol. 72, part 2, 1953, pp. 262-272. 

5 “A Differential-Analyzer Study of Certain Nonlinearly 
Damped Servomechanisms,”” by R. R. Caldwell and V_ C. Rideout, 
Trans. AIEE, vol. 72, part 2, 1953, pp. 166-170. 

6 “Stabilization of a Servomechanism Subject to a Large Ampli- 
tude Oscillation,” by E S Sherrard, Trans. AIEE, vol. 71, part 2, 
1952, pp. 312-324. 

7 “A Noise Adaptive Flight Path Control System,” by D. L. 
Markusen and R. J. Keeler, The Second Feedback Control Systems 
Conference, AIEE, No. 8-63, 1954, pp. 115-122. 

8 “Automatic Feedback Control System Synthesis,” by J. G. 
Truxal, McGraw-Hill Book Company, Inc., New York, N. Y., first 
edition, 1955, pp. 653-663. 

9 “Servomechanisms and Regulating System Design,”’ by 
H. Chestnut and R. W. Mayer, John Wiley & Sons, Inc., New York, 
N. Y., vol. 2, first edition, 1955, pp. 345-364. 

10 “The Effect of Backlash and of Speed-Dependent Friction 
on the Stability of Closed Cycle Control Systems,” by A. Tustin, 
Journal of the Institution of Electrical Engineers, vol. 94, part 2A, 
1947, pp. 143-151. 

11 ‘Backlash in a Velocity-Lag Servomechanism,” by N. B. 
Nichols, Trans. AIEE, vol. 72, part 2, 1953, pp. 462 and 466. 

12 “Theory of Servomechanisms,” by H M. James, N. B. Nichols, 
and R. 8. Phillips, McGraw-Hill Book Company, Inc., New York, 
N. Y.., first edition, 1947, pp. 154-157. 

13 “Recent Advances in Nonlinear Servo Theory,” by J. M. Loeb, 
Trans. ASME. vol. 76, 1954, pp. 1281-1289 (see author's closure). 


f | | 
| 
| \ | 
04 i | | 
| | 
. Ol 0.2 0.5 1.0 2.0 5.0 


Appendix 


The differential equations for each region with initial and final 
conditions are given for a normal half cycle that includes four 
regions, Fig. 3. The variables have been nondimensionalized. 

Denote 


Region A, Second Order 


d 
K,{r, — (duc — 1) 
o 


T 
+ — Ono — 1)}}. . [18] 


Region B, Third Order 


da da? + Ky do + Ky T = Ky T — 1)..[21] 


Pm = 2 


At o = 01, 


continuous 


TRANSACTIONS OF THE ASME 


= 7—1 


do, | 
do 


do* | 
Region C, Second Order 


In 


At o= 02, 


+ Ky, = K(As—1)........ [24] 


= + (a1 — — Pmo + 1] 


(n — dm — 1) do. . [25] 


= 
dog 
do* 
Region D, Third Order 


do 


continuous 


+ Ky, on = Ky (—n — 1). [28] 


= 


continuous 


If dp,,/do = 0 at any time prior to entering region D, the prob- 
lem is not “normal’’ and the system behavior is described in the 
section on System Dynamics. 


At = 


592 
4 
on = ome = 
oo” 
Ki 
T 
K, = K,K,,.T, 
Ato = 02, 
do? continuows................ [26] 
At = On = Pmo 
At Om Ono — 2 
| 
: do 
continuous................[20] 
| do dot 
dom 
“do® 
(30) 
do? 


Coefficients for the annular orifice formed between a 
circular disk and a cylindrical tube are reported for twenty- 
one orifices having disk diameter to tube diameter ratios 
in the range of 0.95 to 0.996, and orifice length-to-width 
ratios from 0.118 to 33.3. The orifice Reynolds-number 
range is from 2.0 to 20,000 for both tangent and concentric 
orientations of the disk. Comparison with previous data 
indicates that the results also apply to the annular orifice 
formed when a rod extends through a circular hole in a 
plate. Theoretical and semi-empirical equations are de- 
veloped to predict coefficients for annular orifices. 


NOMENCLATURE 


The following nomenclature is used in the paper: 


3 
B = orifice geometry constant, B = 1 + y Z 


C= 
C’ = 


over-all annular orifice coefficient, Equation [1] 

point value of the annular-orifice coefficient at a given 
angle @ in a tangent orifice 

coefficient of stream contraction in an orifice, Equation 
[10] 

= orifice coefficient for an eccentric annular orifice 

inside diameter of shell or outside diameter of an annular 
orifice, ft 

hydraulic diameter of an annular orifice, ft, D, = D—d 

= fraction of maximum pressure recovery due to stream 

expansion from the vena contracta to full annulus area 

actually recovered in a given orifice 


= 


G = mass flow rate through an orifice, lb/(hr) (sq ft) 
K = number of velocity heads pressure drop due to kinetic 
energy effects in the viscous-flow regime, Fig. 9 
L = disk thickness, hence, orifice length, ft 
P = pressure, psf 
AP, = pressure drop due to friction in the annulus, psf 
AP, = total pressure drop across an annular orifice, psf 
R = outer radius of annular orifice, ft 
Re = annular orifice Reynolds number, Equation [2] 
Re’ = point value of Re at angle @ in a tangent annular orifice, 
Equation [31] 
S = cross-isectional area of an annular orifice, sq ft 
Sve = cross-sectional area of stream at vena contracta, sq ft 
V = veloc ty through annulus, fph 
W = weight rate of flow, lb/hr 
Z = concentric annulus length-to-width ratio, Equation [3] 
Z' = point value of Z at angle @ in a tangent annular orifice, 


Equation [24] 


' Hanford Atomic Products Operation, General Electric Company, 
Richland, Wash.; formerly, Research Fellow in Chemical Engineer- 
ing, Department of Chemical Engineering, University of Delaware, 
Newark, Del. 

? Professor of Chemical Engineering, University of Delaware, 
Newark, Del. Mem. ASME. 


Contributed by the Heat Transfer Division and presented at the 
Spring Meeting, Portland, Ore., March 18-21, 1956, of Taz AMERICAN 
Society of MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, August 10, 
1955. Paper No. 56—S-22. 


Flow Through Annular Orifices 


By K. J. BELL! ann O. P. BERGELIN? 


593 


a = clearance between disk and shell at a given angle @ in a 
tangent annular orifice, ft, Equation [22] 

outside diameter of annular-orifice disk or inside diame- 
ter of annular orifice, ft 

friction factor for flow between parallel plates of infinite 
width 


pAP 
= 


conversion constant 
4.17 108 (ft) (Ib-mass) / (hr?) (Ib-force) 

= inner radius of an annular orifice, ft 

= distance from center of disk to a given point on inner 
perimeter of a tangent annular orifice, ft, Equation 
[18a] 

distance from center of disk to a given point on outer 
perimeter of a tangent annular orifice, ft, Equation 
[19] 

eccentricity of an annular orifice. € is equal to the dis- 
tance between centers of shell and disk divided by dif- 
ference in radii of shell and disk. Therefore, ¢ = 0 for 
a concentric orifice and € = 1 for a tangent orifice 

angle of displacement of a given point on periphery of a 
tangent orifice from point of greatest clearance, ra- , 
dians 

fluid absolute viscosity, lb/(ft) (hr) 

numerical constant, 3.14159 

fluid density, pcf 

mathematical symbol of summation 


= 


My AE 
da 


INTRODUCTION 


In the design of baffled shell-and-tube heat exchangers, me- 
chanical considerations require that there be clearances between 
the shell and the baffles and between the tubes and the baffles. 
The annular orifices thus formed permit a part of the shell-side 
fluid to leak past the baffle, decreasing both the shell-side pressure 
drop and the heat-transfer coefficient. Therefore, it is desirable 
to know the annular orifice coefficient as a function of the orifice 
geometry and Reynolds number in order to allow for internal 
leakage during exchanger design. These coefficients are also ap- 
plicable wherever annular clearances are encountered in other 
types of equipment; i.e., for housings around shafts and for 
labyrinth seals. As part of the Cooperative Research Program 
on Heat Exchangers at the University of Delaware, an experi- 
mental and analytical investigation of annular orifice coefficients 
was made. The results of this study are summarized here. 


EXPERIMENTAL EQUIPMENT AND PROCEDURE 


The experimental apparatus consisted of a cylindrical shell, 
several disks of various thicknesses and diameters, disk-support 
rods, an end plate, an approach header, manometers, thermo- 
couples, and two separate fluid pumping and metering systems for 
the oil and the water. Figs. 1, 2, 3, and 4 show the orifice con- 
struction and the equipment layout. In operation the fluid was 
pumped through the orifice and the flow rate and pressure drop 
across the orifice were measured after both flow and thermal 
equilibrium were attained. The results expressed in terms of 
orifice coefficient, orifice Reynolds number, and orifice length-to- 
width ratio are shown in Figs. 5 through 8. The equations de- 


TRANSACTIONS OF THE ASME 
— Norrie filled with cost solder this end only Bia 


orifice position 
—— Flow is from lett to right 


Fie. 1 Dracram or SHELL 


Section 52541 PT rein Conn 


fining these terms are 


Ww 
Cs 
S (2p9.4Pr) 


D—ad)G 
Squere- age J 
(A) except 
502-S @ 520-8R) 
2L 


Disc Profiles Z= ‘ 
Details of the apparatus and procedures are presented in the Ap- 

: ~ne, Stang pendix. The dimensions of the orifices which were tested are 

16 Normal Approach given in Table 1. 

~ Zoom Stee! Rod 


THEORETICAL ANALYSIS AND EXPERIMENTAL RESULTS 


In general, the orifice coefficient is a function of the orifice di- 
mensions, the orifice Reynolds number, and the orientation of 
sre disk to shell; i.e., concentric or tangent mounting. The method 
of analysis which was used, was to postulate a mechanism of flow 
for each general case to be considered and then to separate the 
Fic. 2 Mountina-Rop Derarts over-all mechanism into individual phenomena acting simul- 

taneously. Each individual phenomenon was analyzed in the 
light of established principles of fluid mechanics, using rigorous 
methods where possible and empiricism 
TABLE 1 ORIFICE DIMENSIONS where necessary. Finally, the results 
Orifice of the individual analyses were com- 
eq fi a bined to predict the over-all orifice 
0.01284 4-22,000 coefficient. 
428-6560 Space does not permit the details of 
7100-24, 300 the analyses (1)* to be shown. Hence, 

18-21, 200 only the postulated mechanism and the 

6-21000 final equation are given for each case. 

— In the following discussion, flow 

ranges are designated as viscous, tur- 

bulent, or transition, based on the 
Reynolds-number ranges as follows: 


Mounting Rod Details 


4 


0.01283 


0.00589 
0.005885 


0.00304 


&& $8 
4 


6-21000 
6-280 


96-3200 
55-4000 
7000-16 ,000 
2-16, 600 
4-20, 500 


2-7300 
2-37 


54-5500 
3-17 ,300 


4-17 ,700 


1 The viscous-flow range refers to 
Reynolds numbers less. than 40, where 
the predominant effect is energy loss by 
viscous shear in the fluid, and the ki- 
netic effects are confined to the ac- 
celeration of the fluid as it flows into 
the orifice. 


| 


Se 
$2 88 


5. 
* Tangent—-T; concentric—C. + Numbers in parentheses refer to the 
Nore: Shell diameter was 5.2541 in. for all tests. Orifice profiles are shown in Fig. 2. Bibliography at the end of the paper. 


594 
- 
it a\ 
| 
: diam 


APRIL, 1957 


Fic. Test Secrion Heapers ReMoveD 


2 The turbulent-flow range refers to Reynolds numbers above 
4000, where the predominant effects are the kinetic-energy losses 
associated with stream acceleration, contraction, limited expan- 
sion, and turbulent friction. 

3 The transition-flow range refers to Reynolds numbers be- 
tween 40 and 4000, where both kinetic and viscous phenomena are 
important. 


There are no sharp demarcations between the flow ranges, and 
the foregoing division is arbitrarily based on the limits of applica- 
bility of the derived equations, as shown by the experiments. 

TABLE 2 FLOW RANGES DISCUSSED 


Type of orifice Viscous Turbulent Transition 


Concentric, sharp edge. . . D 
Concentric, thick, square edge : E I 
Concentric, thick, round edge - — F J 
Tangent, sharp edge.......... K 
Tangent, thick, square edge. . ; G,H K 


Eleven separate cases, A through K, are discussed in the follow- 
ing sections. Table 2 serves as an index to the discussion and 
outlines its general scope. For each case the theoretical analysis 
is discussed first and, where possible, a suitable equation is de- 
veloped. The experimental results are then presented and com- 
pared with the analysis. The measured coefficients for two typi- 
cal orifices, 5.02-S and 5.20-1, are given in Figs. 5 and 6, and 
smoothed cross plots of all the data are given in Figs. 7 and 8. 

(A) Viscous Range, Concentric Orifice. The mechanism of flow 
through an annular orifice in the viscous regime is assumed to 
consist of six steps: 


1 Viscous deformation of the flow profile immediately up- 
stream from the orifice. 

2 Acceleration of the stream to a uniform velocity at the 
orifice entrance. 

3 Development of a parabolic velocity distribution. The 
theoretical terminal-velocity distribution for viscous flow in an 


Fie. 4 Apparatus ASSEMBLED FoR Normat Approacu Run 


annulus is not exactly parabolic, but for the orifices tested, the 
deviation from a parabolic distribution is negligible. 

4 Viscous flow with friction losses through length of orifice. 

5 Dissipation of the kinetic energy of the stream downstream 
from the orifice. 

6 Downstream viscous deformation of the stream. The equa- 
tion for this case can be shown to be 


where K is taken from Fig. 9. The first term on the right side 
of Equation [1] gives the effect of steps 1 and 6, the second term 
allows (2) for step 4, and the last term, which is negligible 
for Re < 4, represents the effect of steps 2,3, and5. Equation [4] 
is also applicable to an orifice whose disk has a rounded leading 
edge, if the rounded length is neglected in calculating the width 
to length ratio Z. 

For Reynolds numbers below 10, theory and experiment agree 
within about 2 per cent, except for a few cases in which there was 
considerable difficulty in centering the disk in the shell. It is be- 
lieved that the theory is exact in this range and that the error 
was'due to eccentricity of mounting. An approximate equation, 
the exactness of which increases with increasing Z, for the predic- 
tion of the effect of eccentricity is (4) 


For 10 < Re < 40, the theoretically predicted coefficients fall 
up to 10 per cent under the experimental results for reasons 
presently unknown. In this range it is recommended that the 
experimental results be used with due regard to possible ec- 
centricity. 

(B) Viscous Range, Tangent, Sharp-Edge Orifice. The anal- 
ysis of the annular orifice formed when the disk is tangent to 
the shell is somewhat more complex than the concentric-orifice 
analysis because the clearance between disk and shell varies from 
(D —d) to zero. Consequently, the point values of Re, Z, and C 
vary. The basic assumption made in the analysis of the tangent 
orifice is that the orifice coefficient at a given point along the 
periphery of a tangent orifice is the same as the coefficient of a 
concentric orifice having the same values of Re and Z as exist at 
the given point. 

The problem is then one of expressing the geometric variables 
as functions.of position along the periphery relative to the point 


>» ima \ 
1 
Re t Be 
C. 3 


FOMIUO-LNIONV |, AO AUVAWAG ‘OL 


+ 


TRANSACTIONS OF THE ASME 


C51) 


9215140 
yo Asowwng 


@ 


991510 


(b) 


| 


4O AsowWwnS 


506 
| 
i [yy / s/i// / 7| 
| 
| | / / 
Fale, 
| 
4 
+ 
Si. | 
| | 6 3: 
| 
| 
| | By 
| i | 
| | | isi Nig | 
| | | 


APRIL, 1957 


0.025 003 


0.005 0015 002 


Fie. 9 Kinetic Term ror Viscous Flow 1n ANNULAR ORIFICE 


of tangency, expressing the orifice coefficient as a function of these 
geometric variables and integrating the resulting expression over 
the entire orifice circumference. The detailed treatment of the 
case of a tangent orifice having Z equal to zero, assuming negligible 
kinetic effects (K < 64/Re), is given in the Appendix. The re- 
sulting equation is 


It has been found that the use of K from Fig. 9 gives fair agree- 
ment with the data for Re < 40 for tangent orifices, anc 30 it may 
be used to calculate orifice coefficients. For the case of Z = 0 and 
hence K = 1, the recommended equation is 


1 64 R+r 
+ 


3Re 
where the symbol = means “approximately.’’ 

It may be seen from Fig. 5, that Equation [7], which is a form 
of Equation [9], and the experimental results agree within about 1 
per cent for Re < 25. For Re > 25, experimental data should be 
used. 

(C) Viscous Range, Tangent, Thick Orifice. The same method 
of analysis as used in Case B may be used for an orifice having Z 
not equal to zero, again neglecting kinetic effects. The resulting 
expression for the orifice coefficient is 


2 


BR(B — 1)* 
(B? — 1) 


32(R + r) 


R+ — —r) + B*(3R — 2r) 


r(B — + 1) 


3 
where B=1+ Z, and Z #0 


It is possible to arrive at a simpler, but not exact, expression for 
thick tangent-orifice coefficients by assuming that the expressions 
for stream-deformation losses and channel-friction losses may be 
independently integrated and then combined. In this form it is 
also convenient to include K as an empirical correction factor for 
kinetic effects. The result is 


1 64 (/R+r 192 R+r 


128 
or = “J 


Cc? 3Re 5 Re 
The maximum difference between Equation [8] and either form 
of Equation [9] is 0.8 per cent at Z = 4/3. 

Equation [9] predicts experimental results within about 2 per 
cent for Re < 40. As Z increases, the range of validity of Equa- 
tion [9] increases. Hence, Fig. 6 shows that for Z = 33.3, Equa- 
tion [9] is valid to Re = 100. 

(D) Turbulent Range, Concentric, Sharp-Edge Orifice. For tur- 
bulent flow through a sharp-edge orifice, the stream continues to 
contract and accelerate downstream until a minimum area Sve is 
reached at the vena contracta. The stream then expands and de- 
celerates, and the kinetic energy is dissipated as friction. For the 
orifices studied, the downstream area is very much greater than 
Sve, the pressure recovery is negligible, and the total pressure loss 
is equal to that required to accelerate the fluid to the vena con- 
tracta plus any entrance-friction losses. Entrance-friction losses 
are generally considered to be not over 1 to 2 per cent of the gain in 
stream kinetic energy, so they are conveniently included in the 
acceleration pressure drop. Therefore, with small error 


C = C, = Sw/S.... 


The value of C, is taken from Fig. 5, a plot of the coefficient for 
orifice 5.02-S. In the turbulent-flow regime, orifice 5.02-S be- 
haves as a sharp-edge orifice even though it has an edge of finite, 
but small, thickness. 

No quantitative theory was developed for this case. Values for 
C should be taken from Fig. 5. 

(E) Turbulent Range, Concentric, Thick, Square-Edge Orifice. 
For the case of the thick orifice in the turbulent regime, two 
phenomena, in addition to the initial acceleration and contrac- 
tion, occur and must be included in the analysis. These phe- 
nomena are: 


+ K 


1 The stream expands from the vena contracta to the full area 
of the annulus with a partial recovery of the kinetic energy as 
pressure. The expansion from the vena contracta begins a finite 
distance downstream from the orifice entrance, and no pressure 
recovery will be obtained in an orifice whose thickness is less than 
this distance. Also the pressure recovery is not instantaneous, but 
occurs over a distance. Hence, an orifice whose thickness is be- 
tween the value at which pressure recovery starts and the value 
at which the recovery is essentially complete will benefit by only 
a fraction of the maximum possible pressure recovery. The na- 
ture of the “efficiency of pressure-recovery’’ function is deter- 
mined from the experimental results. 

2 Wall-friction losses occur during flow through the orifice 
length. The frictional losses are calculated from a friction factor 
versus Reynolds number curve for flow between parallel plates, 
the curve being based on available data (2, 7, 8). This method of 
evaluation of annular-orifice friction losses can be considered only 
approximate due to the disturbed entrance conditions, partial 
inclusion of frictional losses in the analysis of the pressure re- 
covery, and lack of knowledge regarding relative roughness. 
The resulting equation is found to be (1) 

1 1 2 

where F = 0, for Z < 1.15 and F = 1 — e~-%*(2-1-1), for Z > 
1.15. C, and f, are taken from Figs. 5 and 10, respectively. 

For most tests Equation [11] predicted results within 2 per 
cent for Re > 4000. In a few cases, especially for Re < 10,000, 
there were differences up to 4 per cent. Values should be taken 
from Fig. 6 where available, but Equation [11] appears to be en- 
tirely acceptable for extrapolation. 

(F) Turbulent Range, Concentric, Thick, Round-Edge Orifice. 


| 
+ - ++ + + —— 
14 + + +--+ + —+— + + 
12) + +— —+ + + | 
| | | | 
— 
EE R+r (6] 
1 128 


100 2 4 1000 6 810,000 2 
Fig 10 
Fic. 10 Friction Factor rok ANNULI OF Fine CLEARANCE AND 
FOR PARALLEL PLATES 


For the case of the round-edge orifice in the regime of fully de- 
veloped turbulent flow, Equation [11] can be simplified con- 
siderably. If the disk edge is sufficiently rounded, there will be no 
stream contraction; i.e., C, will equal unity minus any effect of 
entrance friction, and there will be no pressure recovery. As- 
suming a 1 per cent loss of kinetic energy due to entrance friction, 
Equation [11] becomes 


1/C* = 1.01 + 


The rounded-entrance length is not included in the calculation of 
Zz. 

On the basis of results from one orifice only, Equation [12] 
agrees with experiment within 2 per cent for Re > 10,000. 
Equation [12] is seriously in error at lower Reynolds numbers, 
probably due to entrance effects which are not included in the 
analysis. For high flow rates and low Z-values, C can be taken as 
unity. At Re > 10,000, Equation [12] may be used to predict the 
effect of thickness. 

(G) Turbulent Range, Tangent, Square-Edge Orifice. A general 
method of evaluating tangent-orifice coefficients is to evaluate 
numerically the equation 


_ 2(C’AS) 


Cc 


from concentric experimental data. The procedure is 


1 Divide the orifice into an appropriate number of sectors. 

2 From Equations [27], [28], and [29] (see Appendix), de- 
termine value of AS, a, and Z’ for each sector. 

3 Assume a value C’ for the sector. 

4 Compvte Re’ from the assumed C’. 

5 Check the assumed value of C’ against the experimental 
value from Fig. 8. 

6 Assume a new C’ and repeat steps 4 and 5 until agreement is 
attained. 

7 Repeat for each sector and substitute the results into Equa- 
tion [13]. 


Equation [13] has been evaluated for several cases, most of 
them for an orifice of Z = 2.40 where the variation of C with Re 
is somewhat erratic. In general, the results are within 3 per cent 
of experiment, and there is evidence that agreement within 2 per 
cent is obtained for orifices of greater Z. However, experimental 
results should be used when available. 

(H) Turbulent Range, Tangent, Thick, Square-Edge Orifice, 
Z>9. Theintegration of Equation [11] in the manner shown for 
Equation [6] in the Appendix for the viscous regime is attended 
with both theoretical and mathematical difficulties. Equation 


TRANSACTIONS OF THE ASME 


{11] applies only to fully developed turbulent flow, but in the 
orifice region near the point of tangency both viscous and 
transition regions occur. Mathematically, it is necessary to re- 
strict the treatment to a thick orifice so that pressure recovery is 
effectively complete at all points around the perimeter. It is also 
necessary to choose mean values of f, and C,, evaluated at the 
over-all orifice Reynolds number. Within these restrictions, the 
equation for a tangent-orifice coefficient for Z > 9 and for Re > 
10,000 is 


Obviously this equation is too restricted in its assumptions and 
too complicated to be of general use. Based on very limited data, 
Equation [14] appears to be applicable to thick orifices at high 
Reynolds numbers, but experimental results should be used where 
available. 

(1) Transition Range, Concentric, Thick, Square-Edge Orifice. A 
rigorous analytical treatment of concentric orifices in the transi- 
tion regime, taking into account all of the viscous and kinetic 
effects, is not possible at present because the exact variation of 
each effect with Re and Z is not known. As a first approximation, 
a rather rough but basically reasonable analysis has been made by 


598 
1 2 1 2 3/2 
2 
2 1 2 
C. 
‘ i (2 2 Z 
/ 
| 
+ L(2r— R) Z € 2) 
1 2 
2 
ce \C, ) 


APRIL, 1957 


combining the elements of the treatments developed for the vis- 
cous and turbulent regimes. The resulting equation is 


1 
F =OforZ< 1.15, F = 1—e-?-%@-1.18 for Z > 1.15 


where C, and f, are taken from Figs. 5 and 10 at the appropriate 
Reynolds number. 

Equation [15] predicts coefficients within 4 per cent over most 
of the range studied. The greatest errors (up to 12 per cent) 
occur for orifices having Z < 2.5, when 50 < Re < 400. These 
errors are believed to be due to pressure recovery occurring closer 
to the orifice entrance than predicted by the F given in Equation 
[15]. In the transition range, experimental values should be used 
whenever possible, and Equation [15] used for extrapolation to 
higher values of Z. 

(J) Transition Range, Concentric, Thick, Round-Edge Orifice. 
An approximate analysis for this case (1) gives 


[16] 


64 
Re + K + 2f,Z 


where K and f, are taken from Figs. 9 and 10, respectively. 

Based on results from just one orifice, Equation [16] predicts a 
coefficient too low by 3 to 5 per cent. Equation [16] is probably 
satisfactory for predicting the effect of thickness and the predic- 
tion of coefficients for orifices of large Z. 

(K) Transition Range, Tangent, Square-Edge Orifice. The 
method developed for Case G is applicable to this case, but ex- 
perimental results should be used when available. 


Discussion or ResuLTs 

The data from all twenty-one orifices were first plotted in the 
manner shown in Figs. 5 and 6. These results were then sum- 
marized in the generalized plots shown in Figs. 7 and 8. 

Sullivan (5) has published orifice-coefficient data for the case of 
a tube passing through a hole of slightly greater diameter in a thin 
plate, the inverse of the present case. His results agree within 
+5 per cent of those reported here. Sullivan also established that 
coefficients for the case of flow approaching normal to the orifice, 
i.e., at 90 deg to the axis of the orifice, are applicable to calcula- 
tion of baffle leakage in baffied heat exchangers where there are 
multiple orifices and the flow approaches the leakage areas from 
various angles. Therefore, the present results are also usable for 
the leakage around tubes passing through baffles. 

Howell (6) studied a sharp-edged disk in a cylindrical shell 
with disk-to-shell-diameter ratios from 0.7 to 0.9. Extrapolation 
of his data to the diameter ratios studied here indicates good 
agreement with the present results. 

In Figs. 7 and 8 the effects of pressure recovery and wall fric- 
tion are clearly shown. For high Reynolds numbers the pressure 
recovery begins at a Z-value of about 1.0 and causes an increase in 
the flow rate and thus in the coefficient up to a Z-value of about 
6.0. For longer orifice channels the wall friction causes a de- 
crease in the flow rate until, at channel lengths giving Z-numbers 
in the range of 10 to 100, the coefficient again drops to about 0.65, 
the value for the sharp-edged orifice. At higher Z-values the fric- 
tional resistance lowers the value of the coefficient still more. 

There is no pressure recovery for Reynolds numbers below 100 
and friction causes a continual decrease in the coefficient as the 
orifice channel lengthens. 

By the use of Figs. 7 and 8 the flow through annular openings 
can be predicted for a wide range of flow rates and orifice dimen- 
sions, Also, to a limited extent, these curves can serve as a design 
guide to enhance the flow, as might be desired for orifice-type 
baffled heat exchangers or for coolant around a shaft, or to 


599 


minimize the flow as would be desired between the baffle and the 
shell of a segmentally baffled heat exchanger. The magnitude of 
leakage flows through the clearances generally found in commer- 
cial heat exchangers is surprisingly large, and often amounts to 
over half of the flow through the exchanger. With a better 
knowledge of the nature of these flows more accurate predictions 
of performance can be made and a more rational approach to de- 
sign is possible. 


ACKNOWLEDGMENT 

The author acknowledges the valuable suggestions on the pro- 
gram of the study made by the Special Advisory Committee of the 
Heat Transfer Division of Taz American Socrery oF MECHANI- 
caL Enarneers. During the course of this work, the committee 
included Chairman A. C. Mueller, C. H. Brooks, A. P. Colburn, 
S. Kopp, W. H. McAdams, B. E. Short, W. H. Thompson, T. 
Tinker, and P. R. Trumpler. Special thanks are due to A. 
Wurster for his advice on the equipment design and his com- 
ments on the design methods. The assistance of Research Fellow 
M. D. Leighton, who took the photographs used in this work and 
who aided in numerous other tasks, is greatly appreciated. Funds 
and equipment were furnished by Tubular Exchanger Manufac- 
turer’s Association, The American Petroleum Institute, Andale 
Company, and E. I. du Pont de Nemours and Company. The 
oils were furnished by the Gulf Oil Company. The National 
Science Foundation provided a fellowship under which the re- 
search was done. 


BIBLIOGRAPHY 


1 ‘Annular Orifice Coefficients V ith Application to Heat Ex- 
changer Design,”’ by K. J. Bell, PhD «esis, Department of Chemical 
Engineering, University of Delaware, Newark, Del., 1955. 

2 “Fluid Dynamics and Heat Transfer,’’ by J. G. Knudsen and 
D. L. Katz, Engineering Research Bulletin No. 37, Engineering Re- 
search Institute, University of Michigan, Ann Arbor, Mich., 1953. 

3 “Laminare Kanaleinlaufstrébmung,”’ by H. Schlicting, Zeit- 
schrift fiir angewandte Mathematik und Mechanik, vol. 14, 1934, pp. 
368-373. 

4 “Viscous Flow Through Pipes With Cores,’’ by N. A. V. Piercy, 
M. 8S. Hooper, and H. F. Winny, Philosophical Magazine, vol. 15, 
1933, pp. 647-676. 

5 “Heat Transfer and Fluid Friction in a Shell and Tube Ex- 
changer with a Single Baffle,’ by F. W. Sullivan and O. P. Bergelin, 
presented at the joint AIChE-ASME Symposium on Heat Transfer at 
the 1955 Spring Meeting of The American Institute of Chemical En- 
gineers. 

6 “A Note on R. A. E. Annular Airflow Orifice,” by A. R. Howell, 
Aeronautical Research Committee Reports and Memoranda No. 1934, 
British, 1939. 

7 “An Experimental Study of the Flow of Water in Pipes of 
Rectangular Section,”’ by 8. J. Davies and C, N. White, Proceedings 
of the Royal Society of London, England, series A, vol. 119, 1928, 
pp. 92-107. 

8 “Fluid Friction in Annuli of Small Clearance,” by R. F. Noot- 
baar and R. C. Kintner, Proceedings of the Second Midwestern Con- 
ference on Fluid Mechanics, Ohio State University Engineering Ex- 
periment Station Bulletin No. 149, 1952, pp. 185-199. 


Appendix 


Derivation of Equation for a Tangent Sharp-Edge Orifice in 
Viscous-Flow Regime With No Kinetic Effects. The first step in the 
derivation is to establish the equations of the geometric variables. 
As shown in Fig. 11, the equations of the two circles are 

rt? = + y?... 


Transforming to polar co-ordinates gives 


[17a] 


r? = 
R? = [s2 cos 0 — (R — r)]? + 82? sin? 0 


where s; and 8 are the distances from the origin to points on the 
inner and outer circles, respectively, at a given value of @. 


Fie. 11 


Expanding, simplifying, and solving Equation [18] for se 
gives 


= (R—r) cos + +~/[(R — r)* cos? + 2Rr — r*].. [19] 


The incremental area of the orifice in any sector is 
dé 
dS = (wes* — ——... 
2a 


Substituting Equations [18a] and [19] into [20] gives 
dS = {(R —r)* cos? @ + (R—r) 
— r)* cos? + 2Rr — r*] cos + Rr—r*}d0..... [21] 


The clearance between shell and disk, a, at angle @ is 


Or using Equations [18a] and [19] 
a = (R —r) cos + V/[(R — r)* cos? 6 + 2Rr —r*] —r 


The point value of Z at angle @ is given by 


(Z’, Re’, and C’ will be used to denote point values of Z, Re, and C 
in a tangent orifice.) -Combining Equations [23] and [24] gives 
L 
(R — r) cos + +/[(R —1r)? cos? 6 + 2Rr — — r 


A simplification in the equations may be made for the case 
where r/R > 0.8 by the following approximation 


Equation [21] then becomes 

dS = [(R — r)* cos? @ + R(R — r) cos + Rr . [27] 
Equation [23] becomes 

a = [28] 


And Equation [25] becomes 


Z’ [29] 


“ (R — r)(cos 8 + 1) 


In order to express the orifice coefficient in terms of the geo- 
metric variables, the incremental flow rate dW through area dS 
is expressed in the following form 


dW = +. [80] 
where C’ and dS are functions of 6. 


For the case of the concentric sharp-edge orifice (Z = 0), it 
was found that, neglecting kinetic effects 


The point value of Re is 


TRANSACTIONS OF THE ASME 


1/C, = 64/Re........ 


Introducing Equations [28] and [30] into [31] gives 


Re’ 


Introducing Equation [32] into [4], noting C now equals C’, gives 


2R—n)(cos 8 + i) 


[33] 


Substituting Equations [27] and [33] into [30] gives 
4(R — -AP,)(cos 1 
dW = ‘| [(R — r)*? cos? 6 
64u 


+ R(R —r) cos 6 + Rr — .. [34] 


Equation [34] is integrated from 6 = 0 to 6 = 7 and the integral 
multiplied by 2 to allow for symmetry. Simplifying the integral 
yields 


8p9 AP (R — r)? 


Ww (1 + cos @)*[(R — r) cos 6 + rjd6 
64u 0 
[35] 
Expanding the terms in the integral and integrating yields 
8 P A(R — r)? 
E + (R—r) sin 8 
64 
1 
(2R — r)(sin 6 cos + 6) 
3 (R —r) sin 6 (cos? 6 + 2) | [36] 
0 
Introducing limits into Equation [36] gives 
8pg. AP — r)? ( 
Squaring Equation [1] and combining with Equation [37] gives 
4Wr(R—r)? r 
2 = [38 


where C is now the over-all coefficient for the tangent orifice. 
Introducing Equation [2] and S = (Rk? — r*) into Equation 
[37] gives 


or, since R and r are very nearly the same for the orifices studied 


C? = 3Re/128...... .. [40] 
Equation [7] is Equation [40] with an empirical kinetic correction 
term. 
Apparatus. The cylindrical shell, shown in Fig. 1, was of cast 
b d bored to an internal diamete 65.250 70% in, 
r of 5. 
ronze and was bor an internal diame 9,000 


The diameter at the test section was 5.2541 in. Pressure taps 


600 
4, \ 
| 
ge 


APRIL, 1957 


were drilled through the cast pad on the side of the shell at the 
points indicated in Fig. 1. These points were */, to %/,. in. up- 
stream, depending on disk thickness, and 9 in. downstream. The 
pressure-tap holes in the inside-shell wall were 1/;¢in. diam. The 
approximate position of the orifice plates is shown. 

The disks were machined from metal plates of the desired thick- 
ness. The disk diameters ranged from 5.0245 to 5.2325 in., giving 
diametral clearances from 0.2296 to 0.0216 in. The disks had 
square edges and ranged in thickness from 0.0135 to 0.8934 in. 
As shown in Fig. 2, the disks were held in place by means of four 
*/,-in-OD steel support rods. These rods were screwed into a 1- 
in-thick steel end plate which in turn was held by the studs on the 
downstream end of the shell. When tests were to be made with 
concentric mounting, i.e., uniform clearance between disk and 
shell around the disk periphery, three support tabs were soldered 
to the downstream face of the disk and extended over the edge to 
touch the shell. When the disks were to be tested with the edge of 
the disk tangent to the shell, a small shim was inserted at one 
point between the shell and end plate to force the disk against the 
shell. Fig. 3 shows the shell and a disk in place with connecting 
piping removed, and Fig. 4 shows the apparatus ready for 
operation. 

A conical header was used to bring the flow from the 2-in. pipe 
to the shell. A 40-mesh screen was inserted between the header 
and the shell to give a reasonably flat velocity profile. 

The oil-pumping system was constructed of 2-in. steel pipe. 
The oil flowed from a large storage tank to one of two rotary posi- 
tive-displacement pumps in parallel. The flow rate through the 
orifice was controlled by a by-pass line immediately after the 
pump. The orifice-flow rate was metered by a bank of four 
calibrated rotameters in parallel, each followed by a control valve. 
Each rotameter was calibrated over the entire range of flow by 
using a weigh tank. Calibrations were checked periodically. 
The metering range available was 200 to 40,000 lb per hr. From 
the control valve, the oil passed through the header and the 
orifice and returned to the storage tank. Auxiliary heating and 
cooling of the oil was done in an independent parallel system con- 
nected to the storage tank. This system consisted of a centrifugal 
pump and a heat exchanger with steam and cold-water connec- 
tions. 

The water-pumping system was constructed of 1'/:-in. brass 
pipe with bronze fittings. The water flowed from an overhead 
surge tank, which served both to maintain a constant head and to 
remove entrained air, to a centrifugal pump. After the pump, the 
stream passed through a calibrated 10 to 100-gpm guided-float 
rotameter and control valve and into the header and orifice. 
From the orifice, the flow passed through a double-pipe heat ex- 
changer and into the surge tank. 

The pressure difference across the orifice was measured on dif- 
ferential manometers using either test fluid (oil or water) over 
mercury, air over test fluid in a vertically mounted manometer, or 
air over test fluid in a slanted manometer with a slope of 0.275. 
The connecting tubing was kept horizontal in regions where there 
might be fluid-density differences in the line. 

Point temperatures were measured by single copper-constantan 
thermocouples inserted into the stream. The temperatures were 
indicated on a self-balancing electronic potentiometer. The 
measured temperatures are believed to be within +0.3 deg F of 
the true temperatures. 

The test fluids were water, Gulf 896 oil (viscosity 1.86 ep at 
150 F), and modified Gulf Crown C oil (viscosity 63.0 ep at 150 
F). 

Experimental Procedure. For the oil runs, the oil was brought 
up to the desired operating temperature by circulation through 
the auxiliary system. The main oil pump was then started with 
the by-pass full open and the system vented under pressure to 


601 


remove trapped air. The valves were adjusted to give an orifice- 
flow rate near the desired maximum and the manometers drained 
until there was no further evidence of air in the lines. The drain 
lines were closed, the flow rate adjusted to the final value, and the 
system allowed to run to equilibrium. For each run, the data 
taken were: Rotameter number and reading, tank temperature, 
inlet fluid temperature, and manometer type and reading. 
Mayometer readings were taken at intervals of 3 to 5 min, and 
three essentially identical readings were considered necessary to 
indicate attainment of equilibrium. When the run was com- 
pleted, the flow rate was adjusted for the next run and the dis- 
charge valve adjusted to maintain a small positive pressure on the 
shell. 

The minimum acceptable reading on any manometer was a 1- 
in. differential; for smaller pressure differentials, a manometer of 
greater sensitivity was used. At the completion of a series of 
runs, ‘‘no-flow readings’’ of the manometers were taken. If the 
manometer levels were ;within +0.025 in. of exact balance, the 
deviation from the null point was applied to the indicated dif- 
ferential as a correction for minute quantities of air in the lines or 
slight misalignment of the equipment. ‘No-flow readings’’ 
greater than the 0.05 maximum differential mentioned were con- 
sidered to indicate appreciable air in the manometer lines. The 
series of runs was rejected and the condition corrected. 

For low flow rates of Gulf Crown C, the rotameters were very 
sensitive to small changes in temperature, and it was necessary to 
weigh the flow for each run. 

For water runs, the procedure was similar to that for oil runs 
except that cooling was accomplished by direct introduction of 
cold water from the mains and by-passing was not required since a 
centrifugal pump was used. 

Calculations and Presentation of Experimental Results. An over- 
all orifice coefficient was calculated for each experimental run, 
using Equation [1]. An orifice Reynolds number also was cal- 
culated for each run, using Equation [2]. For each orifice, the re- 
sults were plotted as C versus Re. The results for orifices 5.02-S, 
(D — d) = 0.2290 in., L = 0.0135 in.; and 5.20-1, (D—d) = 
0.0537 in., L = 0.8934 in., are shown in Figs. 5 and 6. 

The orifice length-to-width ratio Z defined by Equation [3] was 
found to be important. For all of the orifices the experimental 
results are summarized in Figs. 7 and 8, where the coefficient is 
plotted as a function of Z with the orifice Reynolds number as a 
parameter. If more detailed information is desired, the tables of 
experimental data and plots of C versus Re for twenty-one orifices 
may be obtained from the American Documentation Institute.‘ 

Experimental Errors. The calculated orifice coefficient may be 
in error as a result of errors in the measured value of the diame- 
ters, the flow rate, and the manometer reading. The Reynolds 
number may be in error as a result of errors in the flow rate, the 
fluid temperature, and the measurement of the fluid viscosity. 
The maximum possible error in the coefficient was calculated to 
be in the range from 1.8 to 5.2 per cent, and that for the Reynolds 
number from 0.9 to 2.1 per cent. It is believed that these maxi- 
mum errors were never obtained and that the orifice coefficients 
are within +2 per cent of the true value and the Reynolds num- 
bers are within +1 per cent. 

For concentric orifices, there is also a possible error due to im- 
perfect centering of the disk in the shell. This error, which can be 
quite significant in the viscous-flow regime, was discussed earlier 
in the comparison of theoretical and experimental results. 


4 The tables and plots have been deposited as Document No. 4993 
with the ADI Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 25,D.C. A copy may be 
secured by citing the Document number and by remitting $6.25 for 
photoprints, or $2.50 for 35 mm microfilm. Advance payment is 
required. Make checks or money orders payable to: Chief, Photo- 
duplication Service, Library of Congress. 


‘ 


\ 
f 


How RF Concerns the Wood Industry 


By J. W. MANN,' 


High-frequency alternating currents to the order of 
several millions of cycles per second applied to dielectric 
materials are referred to as “RF heating,” and are used 
widely in wood lamination as a means of setting synthetic 
adhesives in such members. Aspects of federal rules and 
regulations and the needs of industry are set forth. Means 
of accomplishing industrial applications; methods of 
estimating time cycles; the unique properties of the high- 
frequency field of force such as selectivity to conductive 
paths and nonuniformity of the heat placement in dielec- 
trics are covered in some detail with specific examples. 


INTRODUCTION 


r NHE wood industries are constantly encountering a problem 
of procuring clear lumber for manufacturing into end prod- 
ucts. Decreasingly available supplies of old-growth timber 

have forced users to employ timber of poorer grades and better, 

and more fully, to utilize those high grades which are still available 
to them. 

This is illustrated well by the search of the plywood industry 
for better and more economical methods of utilizing lower-grade 
peeler logs—to edge-glue veneer from which defects have been 
clipped, and to patch finished panels showing defects. As logs 
become scarcer, this search will be intensified. The growth of 
dielectric high-frequency heating in the wood industries is a 
direct result of this search for better methods of conservation and 
utilization of the raw materials currently available. This 
intensification has been substantial since the end of World War 
II, and continues apace today without prospect of let up. It is 
upon the economic premise that the equipment produced by elec- 
tronics manufacturers substantially assists industry in its 
search for better utilization of its available raw materials and 
assists industry in employing materials not otherwise usable, that 
the dielectric high-frequency heating industry has grown and 
prospered. 

Among wood end products produced with dielectric high-fre- 
quency heating equipment are the following: Door stiles and 
rails; piano sounding boards; spruce piano components; chair 
legs, seats and arms; drawer runners, faces and sides; table and 
desk tops, skirts and siding; stadium seats, barn and church 
rafters; structural beams, stringers and boat keels; oak mine- 
sweeper parts, planking, and hulls; boat parts; finger-joined end- 
glued lumber; shingle panels; siding; pipe and tank staves; 
tent poles; radio masts; hardwood-veneer sheets; Douglas fir, 
endless, edge-glued veneer; TV and radio cabinets; shelving and 
millwork parts; store display cabinets; edge-glued boards and 
panels; furniture core stock and numerous others. 

Types of equipment produced for separate uses are so numerous 
that unnecessary length would be added to this brief if all such 
different applications were described in detail. Prior to 1950 
batch-production electrodes and clamping devices dominated the 


1 Vice-President, Mann-Russell Electronics. 

Contributed by the Wood Industries Division and presented at the 
Spring Meeting, Portland, Ore., March 18-21, 1956, of Taz AMERICAN 
Society or MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
19, 1956. Paper No. 56—S-17. 


TACOMA, WASH. 


means of applying the high-frequency field of force to industrial 
uses, while the more recent trend in large equipment is to con- 
tinuous-process handling in which especially striking progress has 
been made. 

The abbreviation RF concerns the whole subject of heating 
dielectrics by means of high-frequency alternating electromagnetic 
fields of force, mostly in the range of frequencies above 3 mega- 
cycles (me). 

Many dielectrics or semidielectrics, besides wood and certain 
types of glues, are being heat-treated, dehydrated, preheated 
internally, and fabricated by the application of RF. 

In the present state of the art, high-frequency alternating cur- 
rent, to the order of megacycles per second is obtained most con- 
veniently from conversion of conventional 60-cycle-per-sec (cps) 
power sources through the medium of three-element vacuum- 
tube oscillators. Incidentally, the capital investment per kilo- 
watt of RF output power for high-frequency generators, 5 kw 
and above, ranges around $600 per kw. 

The electron swing of an alternating current is quite analogous 
to the motion of a swinging pendulum. Frequency is a measure 
of the number of complete cycles occurring per second. 

Theoretically speaking, high-frequency alternating currents 
are only different from slow-frequency alternating currents with 
respect to the degree to which any one phenomenon manifests 
itself. Sixty-cycle alternating electromagnetic fields show 
negligible heating effects in dielectrics as compared to those in the 
millions of cycles-per-second range. In many cases, other factors 
remaining unchanged, the heating effect is proportional to the 
frequency. Therefore high frequencies seem desirable for dielec- 
tric heating and processing purposes. 


Frequency; Wave Lenota; RESONANCE 


Alternating electromagnetic fields of force are propagated 
through space at the speed of light; thus one complete cycle may 
be expressed in wave length, the wave length being inversely pro- 
portional to the frequency. 

At a frequency of 60 cps the wave length is to the order of 3100 


miles. Experience indicates frequencies 3 me per sec and above 
to be most adaptable for dielectric heating. At 3 mc the wave 
length in space is approximately equal to 100 m or 327 ft. 

Wave length involves load dimensions which in turn involves 
frequency response which again requires that “line length’’ be 
defined. The communications radio engineer is primarily con- 
cerned with radiating a maximum of electromagnetic energy into 
space by means of an antenna or line at the expense of the 
source, whereas the engineer concerned with RF industrial heat- 
ing obviously bends every effort to minimize radiation into space 
and, instead, to convert a maximum of electrical energy into heat 
energy within the bounds of the dielectric being processed, the 
latter included as a part of a line length of the applicator. 
The communications engineer deals with loads in the form of 
antennas which are stable and simple with respect to geometrical 
shapes as compared to the problems met with in RF dielectric 
heating. In the latter, wood and plastics are especially variable, 
changing electrical constants with every second of time during 
the heating cycle. 

The geometrical shapes of clamping means in three dimensions 
can be infinite in variety and the dielectric may be held stationary 
in the field or continuously moved through the system as may be 
desired. 


603 


604 


The communications engineer normally cuts his radiating 
antennas to correspond closely to quarter wave lengths or mul- 
tiples of quarter wave lengths corresponding to the frequency 
employed. Such a resonant length has been designated as the 
“electrical length’’ of the system regardless of geometrical shape, 
meaning that its particular fundamental frequency response is 
identical with the corresponding calculated space wave length. 
Dielectric loads are also tailored to conform to some desired elec- 
trical length. 

It is significant to note that high-frequency electromagnetic 
energy is generally transmitted or received by exciting the source 
and the sink into a state of oscillation involving reflection of 
energy which results in so-called “standing waves.’’ This con- 
dition in turn requires that the line lengths correspond to the 
calculated space wave lengths. Thus there is close correlation 
in RF heating between the geometry of a dielectric load and its 
inherent electrical properties, such as dielectric constant and self- 
inductance thereof, together with the inductance and capacity of 
the associated connections, electrode applicators, supporting 
framework, shielding, and so on. 

Just as in communications, the source of the electromagnetic 
energy may be transmitted to the sink through the medium of a 
nonresonant, nonradiating, traveling-wave transmission line. 
However, owing to the changing characteristics of a dielectric 
load during the heating cycle, such a linkage is highly unsatis- 
factory in many types of application to dielectrics. 

A superior means of furnishing energy from an electronic gen- 
erator to an electrode configuration which applies the field of 
force to the dielectric constitutes, in effect, making the load con- 
figuration part of the oscillator itself and eliminating any link 
between the two, such as is described as a nonresonant traveling- 
wave transmission line. 

Generators of the type which encompass the work load as a 
part of the oscillatory circuit rather than those connected to the 
load by output coupling means are usually self-excited generators. 
They have no frequency-control circuit such as a crystal oscillator 
to maintain fixed frequency. Self-excited generators have been 
found adaptable to work with wood whose dielectric swing takes 
place on heating of the load. During heating, therefore, the LC 
ratio of the oscillatory circuitry shifts, causing a shift in oscillat- 
ing frequency; at the 13.56-me frequency as much as plus and 
minus 250 kilocycles (ke) from center frequency at the 27.12-me 
frequency up to 500 ke, and at the 40.68-me frequency up to 700 
ke but many variations are less, depending on electrode configura- 
tion, load, and so on. 

Configurations and types of dielectric loads which have rela- 
tively slow natural resonant periods are best suited to being ex- 
cited by means of correspondingly long wave length (slow fre- 
quency) sources of RF energy. 

Following are some practical examples of load dimensions to 
frequency. In loads of large dimension (i.e., 10-ft-long veneer 
for edge-gluing), a frequency of 13.56 me is most readily em- 
pioyed; in loads of medium size (i.e., 4-ft-long glued shingle 
panels), a frequency of 27.12 mc is a natural; while on smaller 
loads (i.e., plastic sealing and butt finger joints), a 40.68-mc 
frequency makes the best adaptation in circuitry which permits 
the load to function as an integral part of the generator oscilla- 
tory circuit. 


RADIATION AND REGULATION 


Tt is difficult to eliminate radiation of RF energy into space 
from any high-frequency source such as an RF industrial installa- 
tion and hence the Federal Communications Commission has be- 
come interested in interference to communications which might 
result from improper installations. 


TRANSACTIONS OF THE ASME 


In 1945 rules were promulgated (Rule 18) and space was al- 
lotted in the spectrum for industrial applications of RF and, for- 
tunately, whether by design or by expediency, frequencies were 
allotted which later proved to be of wave lengths quite suitable 
to matching the dimension of many industrial dielectric applica- 
tions; three specific examples of this were just pointed out. 

Little was understood at the time concerning RF applications 
to industry and, as a result, only these few and meager band 
widths were allotted in an overcrowded spectrum already prac- 
tically monopolized by communications. 

These three frequencies have tolerance plus and minus which 
are inadequate if industrial, scientific, and medical equipment is to 
confine its operation to “on-band’’ use. Restriction for “off- 
band”’ use are so tight that many applications which might be 
used cannot currently be considered. To be practical and of use 
to industry the present bands must be widened for on-band oper- 
ation with harmonic and near field protection provided for the 
frequencies employed for dielectric high-frequency heating appli- 
cations. 

It will be noted that no frequency is provided by Rule 18 for 
on-band operation in the range approximating one half of the 
13.56-me band. As pointed out the load size is one factor which 
determines proper operating frequency, and time cycle required 
is a second and of course the most governing economic factor. 
It would appear impractical to make a 16-ft lumber edge gluer 
operating on 13.56 me and to develop at such frequency the re- 
quired power. If a frequency were available it undoubtedly 


,would be employed. The swing of a self-excited oscillator operat- 


ing on a large wood load at such frequency might be confined to 
less than 100 ke each side of center frequency. 

Use of radio frequency by industry is rapidly becoming indis- 
pensable because of two unique properties of RF covered in 
further detail in the following: (1) Quality of selectivity, and (2) 
internal heat placement of the field. Most, if not all dielec- 
trics, not only class as electrical insulating media but also 
as very poor conductors of heat. Consequently, conventional 
methods of imparting heat to the interior portions of dielectrics, 
such as wood or plastics especially, result in time-consuming 
and inefficient procedures. A block of wood may be charred on 
the surface by conduction heat applied externally without ap- 
preciably affecting the temperature farthest from the surfaces 
but the same block of wood exposed to an RF field may be 
charred internally before the surface indicates what is taking 
place within. By applying exterior heat and humidity control 
simultaneously with RF internal heating, the time required for 
kiln-drying lumber may be cut substantially. 


Qua.ity oF SELECTIVITY 


RF exhibits a unique selective effect which, in the majority of 
applications, is basic to many processes. For instance, a wet 
glue line if positioned in the electric field, substantially parallel 
to the electric lines of force, will absorb energy at a rate equal to 
about ten times the rate at which the adjacent wood absorbs 
energy. The electric flux fringes into the glue line in preference 
to the body of the wood. This is a patented process known as 
“Harallel bonding,’’ basic especially to edge-bonded panels, 
lumber, veneers, and the like. 

In the drying of veneers, high moisture-content pockets may 
persist. If such veneers are subjected to the selective effect of 
RF the me of the panels may be equalized completely. Jnci- 
dentally, this selective effect has been found to be of potential 
value in the pasteurization of wine and the killing of infestation 
in cereals, and so on. 

It is now conceded that RF is indispensable where heat must 
be generated deep in a dielectric or where certain portions or par- 


APRIL, 1957 


ticles of a nonhomogeneous substance must be differentiated be- 
tween by means of the selective effect. 

It may be stated here that certain glues, such as animal glue, 
although selective due to mc, are not suitable where RF is con- 
cerned. Certain synthetic glues are highly selective and poly- 
merize rapidly with rise in temperature, resulting in a bond 
which is waterproof or highly so, with properties comparable to 
the strength of the natural wood itself. 


NONUNIFORMITY OF THE FIELD 


Theoretical consideration has established the premise that a 
dielectric placed between the plates of a condenser and subjected 
to RF power will be heated throughout absolutely uniformly. 
This can only be true provided the electromagnetic waves sweep 
one after the other, through the dielectric in the form of a travel- 
ing wave. Thus the effect would be average and result in uni- 
formity. 

Barring the losses of heat resulting from surface radiation, con- 
vection, and conduction, experience strongly indicates a non- 
uniform placement of energy concentrated on either side of the 
center plane located half way between the plates of the condenser 
encompassing the dielectric load. This has been conveniently 
designated as the ‘‘two-spot”’ effect and the regions of intensified 
heat energy as “energy concentrations.’’ It is significant to 
observe that in a “single-ender’’ or quarter-wave system where 
one condenser plate opposes a grounded plate, only one highly 
heated concentration will be observed somewhere between ground 
and the live element. On the other hand, in a ‘“‘double-ender”’ 
system, two concentrations will be observed as just described. 

This contradiction of theory requires some deep study. Asa 
matter of fact, RF functions best where standing waves of energy 
are propagated across the dielectric. Each half cycle of alter- 
nating current, where current and voltage are substantially 90 
deg out of phase, contains positive and negative loops of energy, 
the peaks of which are located at or near one-eighth wave inter- 
vals. Since the wave “stands,’’ as it were, with respect to the 
dielectric, so also must the peak-of-energy-conversion regions 
stand in the dielectric. As a result the center portion of each 
quarter wave will be subjected repeatedly to a pulse of heat en- 
ergy and result in the concentration-of-heating effect. 

Of course the contour of electrodes also may influence the 
fringing of electric flux into the dielectric and result in hot spots 
not related. 

Harmonic content of the fundamental field also may be a con- 
tributor to the nonuniform placement of heat energy in a dielec- 
tric. 

Naturally, nonuniform heating may or may not be desirable in 
certain processes. However, knowing the causes, considerable 
correction may be brought about and the effect employed to ad- 
vantage in dielectric high-frequency heating. 

Because of the peculiar phenomena of energy placement just 
described, the RF engineer is faced with the problem not only of 
staying closely within some assigned frequency and related wave 
length, but also he must establish a standing-wave pattern or 
node which will locate the energy concentration in the body of 
the dielectric so that certain maximum desired effects may ensue 
in the right places. In order to function, the standing-wave 
pattern of nodes and antinodes must bridge all discontinuities 
between generator and load, such as variable condensers and in- 
ductances used for tuning each quarter wave length of the system. 

Assume wood in the form of an edge-glued package and 
clamped edgewise and confined to a flat plane between plates 
which also function as the opposing plates of a condenser. 

The frequency response of such a package is affected and con- 
trolled by the following factors: 


605 


1 Capacity of such condenser varies directly as the area of the 
dielectric covering the condenser plates. 

2 Varies inversely as the thickness of the dielectric. 

3 Varies directly with the several composite dielectric con- 
stants of substances composing the dielectric package; i.e., wood, 
glue, moisture, etc. 


Estimate oF Time Cycies 


By experience, the Btu output required to do a certain job 
may be closely estimated, based upon the observed mass rise in 
temperature of types of woods and glues which produces con- 
sistent bonds. In “parallel bonding’’ it may be assumed safely 
that nearly all the me of the glue will be evaporated. Hence, 
by adding the Btu content of the wood to the Btu stored in the 
me of the wood plus the heat of vaporization required to evapo- 
rate the mc in the glue, one may calculate from these data the ap- 
proximate time cycle to be expected from any given-size genera- 
tor. 


MartcuinG Loap AND GENERATOR 


Preferably, the more nearly the load becomes a part of the 
generator standing-wave pattern, rising and falling in unison 
and in the same phase, the more stable the whole system becomes, 
as previously described. 

Some systems use what is known as loose coupling, which 
method pertains almost exclusively in communications. For 
dielectric loads, if an attempt is made to couple too closely for the 
purpose of using the power source to capacity, the load may sud- 
denly rob the oscillator of its stored energy, oscillations cease, and 
the result is a heavy short circuit and power outage. The stand- 
ing-wave pattern of the load is 180 deg out of phase with that of 
the generator. Such a system is quite unstable when loaded with 
widely changing loads as are wood and glue. 

Single-ender systems may be described as those in which the 
load spans a quarter-wave line length of electromagnetic energy 
usually recognizable by the load being interposed between a 
“live’’ plate and the grounded framework of the press. 

In a double-ended system the load is placed between plates of 
opposite instantaneous charge, the press frame being symmetrical 
to, and neutral to, both plates. This system is to be preferred in 
most cases because of more nearly uniform distribution of heat 
and balanced charges. 

The shape and size of electrode configurations must be designed 
to fit the work in hand. They may be shaped to spot set glues, to 
form Z-patterns or a partial set pattern produced by stray field 
and split-pole methods. By these means, with a given power 
source, production rates per kilowatthour may be raised, the 
unset glue being later polymerized, more or less, by the residual 
heat stored in the wood. In most of these cases parallel bonding is 
employed. In the case of scarfed joints, perpendicular bonding 
has been found a preferred method at a sacrifice of time cycle, 
thus producing near full strength immediately and solid-setting 
feather edges of the scarf. 

Wherever possible, continuous-process bonding devices should 
be used because of the higher production rates possible per kilo- 
watt as compared to the stop-and-go system. 


Oruer Factors 


Since the press frame work, auxiliary parts together with the 
electrode applicator, form a part of the whole radio circuit, me- 
chanical design of the press is governed by the physical proper- 
ties of electromagnetic waves, electromagnetic fields and electric 
charges, resonant line lengths, and so on. 

The size, strength, and design factors of clamping devices are 
governed by the character of the load and the practicability of 
suiting certain wave lengths and wave patterns to the load. 


606 


Hence the RF engineer and the mechanical engineer must come 
to mutual understanding in the complex design of RF heating 
equipment. 

Comment should be made with regard to preparation of stock 
and the control of moisture contents because RF involves pre- 
cise factors which in turn are best served by precise woodworking 
methods, good workmanship, and control. *° 

Glues which behave best in the RF field are synthetic and are of 
two types: highly water-resistant and fully waterproof. The 
ureas are highly water-resistant and polymerize at relatively low 
temperatures while the melamines, the resorcins, and the phenols 
are waterproof but require higher temperatures to set and result 
in comparatively longer cure-time cycles, other factors being 
equal. 

The amount of pressure required for most fabricated wood 
parts ranges between 50 and 150 psi and may be supplied through 
the medium of air or by hydraulic means. Air pressure is preferred 
because of superior speed and cleanliness as compared to oil- 
pressure systems. 

Other factors remaining the same, dense woods normally ab- 
sorb more energy in a given time than light porous woods. 
Edge-gluing results vary between wood of differing types of 
grain structure, such as vertical or slash, summer or winter, and 
soon. These factors sometimes require adjustments in intensity 
and disposition of the RF field to avoid pitfalls of too great capac- 
ity variation and gluing results. 

As time elapses, more and more use is being made of RF in 
the wood industry as the need for conservation develops ever- 
increasing economic pressure. 

These needs are being met by the manufacture of RF equip- 
ment and even anticipated with new equipment, one example otf 
which is a new configuration for setting end-glued finger joints 


continuously at high rates of lineal speed. 


Economics 


A few of the technical aspects of dielectric high-frequency heat- 
ing have been covered, but each has its specific relation to the 
economics of production in the wood industries. Were the 
savings obtained, or the betterments in production processes in 
doubt, industry would not turn as it has recently to the use of 
this new tool of production. To be specific, a door factory had a 
problem of waste; so great was the trim from ends of dried and 
otherwise good wood, that raw-material costs exceeded that 
which could be borne in competition with lower-cost producers. 


TRANSACTIONS OF THE ASME 


Installation of an electronic edge-gluing machine made possible 
utilization of the waste product in laminated built-up stiles; 
short pieces between outer bands of solid wood with veneered 
faces. Such were the savings that more doors were produced 
with the same mill input of wood, utilization increased, and an- 
nual savings in wood cost exceeded the value of the electronic 
unit by over ten times. 

Not only do the lower costs of production work out to the finan- 
cial betterment of the user but another factor deserves considera- 
tion. This factor is the ability of a wood-using industry to make 
products not otherwise possible of manufacture. This is illus- 
trated by a producer of vertical-grain, diagonally edge-glued 
piano sounding boards the production of which was impractical 
before application of RF to the problem. Also built around 
electronic edge-gluing is an entire department of a wood-pipe 
and tank manufacturer who found it possible to utilize very low- 
cost wood scraps and end cuttings as raw materials and so re- 
duced cost of production that wood pipe became competitive 
with stainless steel and iron pipe for certain paper, chemical, 
and corrosive-materials use. 

Simple to illustrate is the competitive situation in edge-glued 
lumber panels for producers of TV cabinets, sideboards, kitchen 
cabinets, and myriads of like products; hand-glued batch produc- 
tion was so high in cost that substitutes in the form of low-grade 
plywood were coming to the market in increasing quantities. 
Installation of continuous-process lumber edge gluers with auto- 
matic cutoff saws for the production of these panels has shown 
ability to produce over 25,000 fbm in an 8-hr shift—placing the 
producer right back in the thick of the competitive scrap with 
lower-cost substitute materials, thus not only retaining but ex- 
panding his market at a rate of return more favorable than pre- 
viously he could expect to receive. 

Installation of electronic, continuous process, veneer edge- 
gluing equipment in plywood plants has expanded greatly in 
recent years showing greater utilization of random-width veneers, 
less rejects to the burners, and increased veneer output per log of 
raw material peeled. Elimination of tape in splicing, sanding of 
panels, and ability to patch exterior-grade plywood electronically 
has added to the production and lowered the cost of panels for 
most of the progressive plywood mills in the California, Oregon, 
Washington, and British Columbia region of plywood production. 
Low operating and maintenance cost is a factor favoring the 
economics of production through use of dielectric high-frequency 
heating. 


|_| 
me 


Effect of Ambient and Fuel Pressure 
on Nozzle Spray Angle 


By S. M. DE CORSO! anv G. A. KEMENY,!' E. PITTSBURGH, PA. 


Diametral samples across the fuel spray at a distance of 
4'/, in. from the nozzle tip were obtained for ten centrif- 
ugal-type nozzles of 9 to 100-gph capacity, having nominal 
spray angles of 45 and 80 deg. The data were taken over a 
fuel-pressure range of 25 to 400 psi and for ambient pres- 
sures from 0.1 to 8 atm. These diametral spray distribu- 
tions were reduced to equivalent spray-angle values which 
when plotted against ambient and fuel pressure provided 
a summary of the pressure effects on the spray angle. It 
was found that the spray angle decreased markedly with 
increasing fuel and ambient pressure. An explanation of 
the phenomenon is given. The equivalent spray angle 
was found to be a function of the product of fuel pressure 
drop and ambient gas density to the 1.6 power, i.e., Py'*. 


INTRODUCTION 


T HAS been found that the fuel-nozzle spray angle is an im- 
portant factor in the burning of liquid fuels in gas-turbine 
combustors. Combustion efficiency, flame length, and 

combustor-wall temperatures are all affected by the nozzle-spray 
distribution. 

This paper deals with the effect of ambient and fuel pressure on 
the spray distribution of centrifugal nozzles. The centrifugal 
fuel nozzle is an important type, widely used in industrial gas 
turbines and in turbojet engines. This type of nozzle is designed 
to give, at atmospheric pressure, a spray of approximately 
“hellow-cone”’ shape which gives optimum atomization for gas- 
turbine applications. 

A typical design is shown in Fig. 1, together with a spray shape 
obtained at atmospheric pressure. The essential elements of 
the nozzle are the swirl slots, swirl chamber, and outlet orifice. 
The fuel, under pressure, passes through the swirl slots into the 
swirl chamber where a vortex is created. The fuel then issues 
from the outer orifice with a velocity whose axial and tangential 
components determine the spray-cone angle. The spray-cone 
angle is defined as the angle between the tangents to the spray 
envelope at the nozzle. Hereafter in this paper the term 
spray-cone angle will refer to the latter angle. Actually, the 
surface of the spray near the nozzle tip forms a hyperboloid, 
but for all practical purposes, this may be considered a cone at 
low ambient gas pressure. Detailed discussions of the fuel flow in 
nozzles may be found in references (1) and (2).* 

A particular nozzle is identified by giving the flow rate at a 
certain pressure drop in terms of some fuel type and by giving 
the spray-cone angle. For example, a 60-gph 80-deg nozzle is one 
which will discharge 60 gph of diesel fuel at 100-psi nozzle pressure 
drop and has a spray-cone angle of 80 deg at atmospheric 
pressure. 


1 Westinghouse Electric Corporation. 

2? Numbers in parentheses refer to the Bibiiography at the end of the 
paper. 

Presented at the Gas Turbine Power Division Conference, 
Washington, D. C., April 16-18, 1956, of Taz American Society or 
MECHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
16, 1956. Paper No. 56—GTP-3. 


Nozzle Spray 
Cone Angie 
Fuel Spray 
Sheoth 


s Orifice 

Chomber 
Swirl! Slot 
Element 


Fic. 1 Typica, CentriruGaL Nozzie 

Joachim and Beardsley in 1927 (3, 4), and Joachim in 1927 (5), 
have investigated centrifugal sprays in connection with diesel- 
engine injectors. They worked at nozzle pressure drops of 6000 
to 10,000 psi and ambient gas pressures of 200 to 600 psig. These 
pressure levels are of course much higher than those generally in 
use in gas turbines today. From photographs of the sprays, they 
show a decrease in spray angle with increasing ambient gas den- 
sity. D. W. Lee (6) in 1935, in similar experiments on diesel in- 
jectors, also found a decrease in spray angle with increasing gas 
density. In their diesel-injector experiments, the spray outline 
was photographed during the time interval from spray initiation 
to 0.006 sec later. Watson and Clarke (7) in a gas-turbine com- 
bustion study have noted the decrease in spray angle with in- 
creasing gas density, which they illustrated by a photograph. 
Others (1, 8) have noted a slight decrease in spray angle with in- 
creasing nozzle pressure drop at atmospheric pressure. A de- 
crease in angle with increasing air pressure drop was described by 
one of the authors (9) for a swirling annular air jet such as is used 
in connection with an air-assisted fuel nozzle. 

The data presented here cover nozzle pressure drops of 25 to 400 
psi and ambient gas pressures of 1.5 to 114 psia. The nozzles were 
sprayed into quiescent gas at room temperature. The data 
were obtained principally from tests on ten nozzles of commercial 
type which were obtained from a nozzle manufacturer. The 
nozzle capacities (in gallons per hr of diesel fuel at 100-psi 
pressure drop) were 9, 20, 45, 60, and 100 gph with spray angles of 
45 and 80 deg in each capacity, i.e., ten nozzles in all. 

The fuel used in all the spray tests was diesel oil with the follow- 
ing properties at 77 F: Specific gravity, 0.84; surface tension, 
30.7 dynes/cm; kinematic viscosity, 2.43 centistokes. 


DescrIPTION OF APPARATUS AND PROCEDURE 


The test facilities include the high-pressure tank, instrument 
panel, and associated fuel pumps, and controls. Fig. 2 shows the 
18-in-diam high-pressure tank; Fig. 3, the control panel; and 
Fig. 4 is a flow diagram showing the major components of the 


607 


ob. 


Fic. 2 Hicu-Pressure Spray Tank 


Fic. 3 Conrrot 


nozzle-testing installation. As safety precautions, the following 
measures were taken: (a) Nitrogen was used as ambient gas for 


the fuel spray. (b) The tank with observation windows and 
sampling equipment in place was tested hydrostatically at 200 psig. 
(The maximum operating pressure anticipated was 100 psig.) (c) 
A pressure-relief valve set at 105 psig was provided and the tank 
also was equipped with a 6-in. 150-psi blowout diaphragm 
which is shown in the top left corner of Fig. 2. If an increase in 
tank pressure above 150 psig should occur, the diaphragm would 
rupture, and the contents of the tank would discharge through a 
6-in. pipe to the outside of the building. 


TRANSACTIONS OF THE ASME 


Fig. 3 shows the control panel which includes the gages and 
valves required for operating both the new pressure tank and the 
atmospheric-pressure spray system. To avoid confusion during 
tests, the valves were color coded as follows: Red for those used 
only for the high-pressure tank; white for those used only for the 
atmospheric tank; and black for valves which are common to 
both systems. In order to reduce the number of pressure gages 
required, lines were run from pressure-measuring stations to a 
gage manifold. Thus any available gage can be connected to any 
pressure station as required. A few of the gages, however, are per- 
manently connected to pressure-measuring stations. 

Fig. 4 shows the controls which are required for operating the 
two nozzle-testing tanks from a single fuel-pumping system. The 
two tanks each have their own fuel reservoir, and the low-pressure 
pump by pass fuel is always returned to the tank from which fuel 
is being pumped; the fuel flowrators meter at the pressure level 
existing between the low-pressure and high-pressure pumps. A 
pressure-relief valve limits the maximum pressure here to 200 psig 
to prevent damage to the flowrator tubes. Because the flowrators 
meter fuel at a point upstream of the high-pressure pump, a 
water-cooled by pass line from this pump discharges back into the 
high-pressure pump inlet. The fuel system is adequate for testing 
nozzles up to 100-gph capacity at pressures up to 400 psi. 

The air system is used for supplying compressed air from a 
house compressor for air-atomizing nozzles sprayed into the 
atmospheric tank. Nitrogen is used for pressurizing the high- 
pressure tank as well as for the atomizing gas when air-atomizing 
nozzles are to be tested at elevated ambient pressures. The nitro- 
gen exhausted from the tank passes through a mist separator 
which can be bled continuously to a fuel container. Flow through 
the exhaust line can be throttled by a diaphragm valve which is 
operated by a manual control located at the control panel. 

The spray-sampling system has 28 radial sampling tubes spaced 
5 deg apart on the arc of a 4'/,-in-radius circle with the nozzle tip 
located at the center of curvature of the arc. The */;.-in-diam 
tubes are drilled out to 0.168 in. diam at the sampling tip. Each 
tube, as seen in Fig. 2, passes through a packing gland to the out- 
side of the tank. The spray-sampling procedure can best be ex- 
plained by looking at a cross section of the brass bar containing 
the reservoirs, as shown in Fig. 4. While spray is being collected, 
the top 28 toggle valves are open, the bottom 28 toggle valves are 
closed, valve 18 H is closed, and the manifold return valve 17 H 
is open. This allows fuel to collect in the 28 separate fuel-collect- 
ing reservoirs, each reservoir being a hole drilled in the block with 
a toggle valve at its inlet and outlet. Each reservoir is also 
connected by a slanting hole to a manifold which, in turn, is con- 
nected to the tank through valve 17 H. This allows fuel to flow 
freely through each sampling tube to its reservoir. After spray- 
sampling has been stopped, the top toggle valves and 17 H are 
closed. All reservoirs are then reduced to atmospheric pressure 
by opening 18 H. At this point, the bottom toggle valves are 
opened and fuel-spray distribution can be read directly from the 
graduated test tubes. 

If any reservoir overflows during the sampling, the correspond- 
ing test tube will be filled above scale and a test of shorter dura- 
tion must then be made. The overflow fuel collects in the bottom 
of the manifold and can be bled out into a small auxiliary fuel con- 
tainer by opening valve 18 H. After readings are taken, all the 
tubes can be drained into the auxiliary fuel container as is being 
done in Fig. 2. Fuel from the auxiliary tank can be pumped back 
into the high-pressure tank using the regular pumping system. 

Sampling is started and stopped by movement of a shutter 
which covers the sampling tubes. The shutter mechanism is 
operated by nitrogen pressure. The timer switch shown in the 
upper right-hand corner of the control panel automatically opens 
the shutter for the time interval desired. 


608 
as é, a. . 
| 
& 


APRIL 1957 


ExwaustT 
OUTSIDE 


HEAD 
OUTSIDE 


0G 
VALVE (28) 


GRADUATE 
(26 REQ) 


DRAIN TANK OUTLET 


TANK 
PRESSURIZER 


STRAINER 


fi 


ATMOSPHERIC 
PRESSURE 
SPRAY TANK 


=. 


FUEL OW 


COMPRESSOR 


ian 


TANK, FUE STRAINER 
OuTLET 


Fic 4 Nozzie-Testinc Flow Diacram 


Nozzle 


Sompling Tubes 


Spray Distribution — 
Curve 


CC of Fuel 


= 5° 

No. of sampling 
tubes =28 

re at 


Fic. 5 Spray-Samptinc Tuse ARRANGEMENT 


It was found that spray-distribution data could be obtained 
very rapidly with this system. For example, a complete set of 
data on one nozzle (20 spray distributions) can be recorded in 
about 3 hr. 

Table 1 gives the orifice diameters of all the nozzles. 


Discussion or Test RESULTS 


The spray-cone angle of a fuel nozzle does not give an accurate 


28 


24 60 GPH - 80° Nozzle 


Fuel AP = 100 psi 


Tank psio 


CC. of Fuel 


Spray Angle 


Fic. 6 Fue.-Distrisution Curves 
TABLE 1 
Capacity, gph Q 


45-deg angle, in 
80-deg angle, in 


NOZZLE-ORIFICE DIAMETERS 
20 45 60 


0.0585 0.0757 0.0830 
0.0554 0.0779 0.0888 


100 


0.1046 
0.1162 


0.0315 
description of the spray distribution unless the spray sheath is 
very thin, which is generally not the case. Spray distributions 
were obtained in these tests by taking fuel samples at points along 
an are as shown in Fig. 5, the samples being proportional to the 
fuel flow at the respective sampling points. This produces spray- 
distribution curves such as those shown in Fig. 6, where the 
ordinates of the curves represent the amount of fuel collected at 


609 
AiR 
> 
am SHUT OFF | 
orF 
| | 
| WATER 
Fuec [P] (5) COOLER 
(a) | FUEL | | (P} 
PRESS|_JNOZZLE 
CLIMAX \ (1) | 
FLOWRATORS ain LJnozz.e 
SAMPLING + 
&) ‘ave FLOWRATORS 5M 6m 
C) HIGH | MANIFOLD 
LIEF\ PRESSURE = 
: SEPARATO ‘ FILTER 
mist. 62) MOORE PRESS u BLOWERS 
SEPARATORS | REGULATOR 
12H Yi PRESSURE. INLET AIR IN & ~ 
i” 
LD strainer 
= 
CHECK VALVE 
. 
| 
Axis 
: 
rsin@ SS 
| pe” ) : i\ | \| | || 28.6 
: \ |; 3 
I 
: | | | VY Xx 
ALINE ZN, 
| |ae a 80° 60° 40° 20° 0° 20° 40° 60° 80° 
Nozzle 
; Axis 


TRANSACTIONS OF THE ASME 


TABLE 2 SPRAY-DISTRIBUTION DATA 


__ NOZZLE 


60 GPH - 80° 


45 GPH - 80° 


DOWON AN 


5 
7.8 
5 
5 


8 


the corresponding angular location of the sampling tubes as indi- 
cated on the abscissa. The spray angles referred to in our 
sampling data may be considered a convenient measure of fuel 
distribution along a circular arc 4'/2 in. from the nozzle tip. 

Two possible sources of error were checked early in our tests: 
(a) At the high fuel and tank pressures considerable mist would 
form in the tank. By observation of the spray shape during 
sampling tests, it was determined that sampling error due to 
the presence of the mist was negligible. (b) Since the nozzle was 
spraying vertically downward, it was necessary toconsider whether 
gravitational forces were affecting the spray distribution. That 
such was not the case was established by spraying a nozzle hori- 
zontally and noting that the spray remained symmetrical about 
the nozzle axis for at least 9 in. 

The curves in Fig. 6 show the fuel distribution for a 60-gph 80- 
deg nozzle at 100-psi nozzle Ap and various tank pressures. For 
each test condition, a distribution curve was obtained for each of 
the ten nozzles. This gives an accurate representation of the fuel 
distribution at each condition of nozzle and tank pressure. In 
order to describe nozzle performance in terms of the variables in- 
volved, conveniently, it was necessary to reduce each curve to a 
single numerical value. Of the various methods for doing this, 
two have been selected. 

The first method involved reducing the individual distribution 
curve to a “diametral spray angle’—y. This spray angle was 
calculated by finding the value of y = ~, + Wr where W, and 
Wp are equal to 


for the left and right lobe of the distribution curve, respectively. 
As shown in Fig. 5, @ is the angular location of a sampling tube 
measured from the nozzle axis; A@ is the angle between adjacent 
sampling tubes, i.e., 5 deg; y is the ordinate of the fuel-distribu- 
tion curve. 

The second method involves reducing the individual distribution 
curve to an “equivalent spray angle’’—g. This angle is calculated 
for each distribution curve by finding the value of g = ¢, + ¢p 
where ¢, and gp are equal to 

L2rryGAM sin 


sin 6 


sin 6 


Ly sin 


for the left and right side of the distribution curve, respectively. 
The symbols here are the same as before with r being the radial dis- 
tance from the nozzle tip to the sampling tubes and A@’ = 2 deg. 
The value of A@’ was taken smaller than the previously used A@ 
in order to give better accuracy. Calculations have shown that ¢ 
differs at most by 1 to 2 deg from the vertex angle of a cone which 
contains half of the total fuel spray at a 4!/:-in. radius from the 
nozzle. It is assumed here that the spray is symmetrical about 
the nozzle axis. This assumption was verified in several in- 
stances by showing that rotation of the nozzle about its axis had 
negligible effect on the spray distribution recorded. Values of ¢ 
versus nozzle pressure drop, with tank pressure as the parameter, 
are shown for the 60-gph 80-deg nozzle in Fig. 7, while values of 
¢ versus tank pressure, with nozzle pressure drop as the parameter 
for the same nozzle, are shown in Fig. 8. In Table 2, the values 
of W, ¢, and g/g are given for all the nozzles tested together with 
the ambient pressure and nozzle pressure drop at which they 


610 
Nozzle Tank Gas 
100 - 80° 20 GPH 80° 9 GPH 80° 
25 1.5 .00732 .00963 83.5 82.5 1.140 81.0 82.0 1.163 80.5 81.7 1.153* 76.0 77.0 1.100 68.5 7 1.135* 
; 100 1.5 .00732 .0385 680.0 77.0 1.061 79.5 79.5 1.129 78.5 79.2 1.120 $82.0 79.7 1.133 69.1 7 1.1533 
225 1.5 .00732 .0866 ---- ---- ----- 79.2 78.2 1.110 77.5 62.1 1.160 682.0 91.7 1.167 61.5 6 1.078 : 
400 21.5 .00732 .1540 ---- ---- ----- 79.5 80.3 1.140 76.5 76.7 1.083 82.0 82.3 1.175 53.9 6 1.005 : 
25 3.58 .01748 .0375 82.7 83.0 1.146 78.5 80.2 1.138 77.3 79.2 1.120% 73.6 74.8 1.060" 66.3 & 1.142 
. 100 3.58 .O17#B .150 75.5 76.2 1.051 77.5 77.7 1.102 77.3 77.8 1.100 79.0 78.2 1.117 65.9 € 1.137 
400 «3.58 ---- ----- 68.5 72.0 1.022 69.2 71.7 1.013 7h.2 78.2 1.117 51.0 5 0.945 
25 7.15 .0349 .1176 82.4 83.0 1.146 75.2 78.2 1.110 73.3 75.8 1.070" 70.0 72.0 1.028" 63.2 6 1.110% 
100 7.15 .O349 -470 75.1 75.0 1.034 73.7 75.5 1.070 74.2 75.5 1.067 73.0 76.0 1.086 61.2 € 1.079 
4OO 807.15 .0349 1.880 ---- ---- ----- 56.2 62.5 .887 58.6 64.0 .90h 62.7 69.8 .997 41.8 5 . 848 
25 14.3 .0698 .350 81.0 81.5 1.125 71.5 76.0 1.080 67.5 71.8 1.015% 64.0 68.5 .978* 58.8 6 1.071* 
100 14.3 .0698 1.40 73.5 72.5 1.000 66.0 70.5 1.000 67.5 70.8 1.000 63.5 70.0 1.000 53.8 6 1.000 
225 14.3 .0698 3.15 wore cee- ----- 57.0 61.0 .865 61.0 62.7 .885 55.0 63.2 .903 40.5 & 810 j 
400 «14.3 .0698 5.60 42.5 89.7 .705 51.7 .730 46.5 53.5 310 & 686 
25 28.6 .1396 1.075 72.0 73.7 1.08 71.5 73.2 1.039 63.5 67.7 .957* 56.2 68.5 979" 52.0 6 1.057* 
100 286.6 4.300 585 61.7 .852 54.2 59.7 51.5 57.8 .816 50.5 55.25 5 929 
225 28.6 .1396 9.68 48.5 56.8 .756 82.0 48.0 .681 36.5 46.9 .663 39.0 47.30 675 31.0 & 664 - 
17.20 39.0 45.2 .623 30.5 37.2 .528 27.5 41.7 .589 29.0 36.1 .515 2 : 49) 
25 57.2 .279 3.25 60.0 63.4 .875 49.0 56.7 .805 500 57.2 54.9 38: 
100 57.2 .279 13.00 42.0 47.2 .651 31.5 38.8 .550 32.5 40.6 .573 30.3 40.7 .581 2 . €80 
225 57.2 .279 29.20 30.5 37.8 .522 23.0 29.7 .b2l 22.5 31.0 .438 20.0 27.0 38% 1 Lhe 
400 57.2 .279 52.00 28.5 36.2 .500 19.0 25.5 .362 17.0 26.7 .377 19.5 2027 .2390 1 
25 11b.3 .558 9.82 48.6 .670 33.5 38.7 .549 30.5 37.7 .533® 26.5 35.0 .500* 2 36.2 .555* 
100 114.3 .558 39.30 33.2 80.0 25.7 «6415.5 21.7 3i0 1 18.0 .297 
225 124.3 .558 88.40 18.0 25.2: .368 18.0 21.5 .305 15.0 20.5 .290 12.5 17.5 .250 q 13.¢ 223 
4OO 114.3 .558_ 157.20 17.5 26.7 27.9 17.7 24.5 19.5 .276 121.5 16.5 .236 
72.5 (0.5 70.8 70.0 


APRIL, 1957 


TABLE 2 SPRAY-DISTRIBUTION DATA (continued) 


NOZZLE 


Nozzle Tank Gas 
Press. Den- i.6 
Drop psia sity Pt 
P. psi 


00732 


25 114. 
100 114. 
225 114. 
400 114. 


60 GPH -80° Nozzle 


psia Tonk Pressure 


Equivatent Spray Angle - Degrees 


300 ( 
Nozzle Pressure Drop, psi 


100 200 400 500 


Fic. 7 EquivaLent Spray ANGLE Versus Nozz_e Pressure Drop 


were obtained. The ¢o-value is the equivalent spray angle at a 
tank pressure of 14.3 psia and nozzle pressure drop of 100 psi. 
The values at tank pressures of 3.58 and 7.15 psia were obtained 
by interpolation from the Y or ¢g versus tank-pressure curves. 
Blank spaces represent points where data could not be obtained 
because of the formation of a fuel bubble or Weber-number effects, 
except for 100-gph nozzles where blanks represent points at which 
fuel flow was limited due to cavitation at the fuel pump. 

It was found that g/g, for all the nozzles tested, correlates well 


60 GPH - 80° Nozzle 


Equivalent Spray Angle - Degrees 
nN 


20 40 60 80 
Tank Pressure, psia 


Fie. 8 EqurvaLent Spray ANGLE Versus TANK PRESSURE 


with Py'*, where P = nozzle pressure drop in psi and y = am- 
bient-gas density in lb per cu ft. Points for which goWe < 8 X 
10° are distinguished in Table 2 by asterisks and are excluded 
from the plots of ¢/¢o versus Py'-*. go is in degrees and the 
Weber number, We = Pd/o with P = nozzle pressure drop, d = 
nozzle orifice diameter, and o = surface tension of fuel. For these 
points, the bubble or Weber-number effect was predominant in 
the spray. To bring these points into good alignment on the 
curves would require considerable complication of the expression 


: 
611 
25.1.5. .00963 47.0 1.290" 43.2 1.220" 43.0 1.035 28.3 30.5 ---- ---- 
100 60385) 44.5 44.8 1.229 45.0 1.234 45.5 46.2 1.132 40.8 43.0 1.156 38.8 42.8 1.268 
225 1.5 .00732 .0866 bb.o 1.205 45.5 1.248 46.3 47.2 1.160 43.8 4b.5 1.196 41.0 43.8 1.288 
- 25 3.58 .01748 .0375 43.2 45.7 1.253* 41.4 43.2 1.185* 40.5 42.0 1.032" 25.0 29.2 ---- ---- -«--- : 
100 3.58 .0i748 .i50 42.8 43.3 1.187 41.4 43.2 1.185 43.5 45.2 1.110 38.8 41.8 1.122 37.4 40.7 1.197# 
400 6.3.58 .01748 .600 40.8 1.120 41.4 bb.o 1.080 39.7 42.4 1.140 37.2 40.3 1.183 
25 7.15 .0349 40.8 43.2 1.163# 38.8 41.4 1.134# 37.5 39.4 .968" 21.7 27.2 .731® ---- ---- ----- 
100 7.15 .O349 .470 4o.2 41.0 1.123 38.4 40.7 1.117 40.7 bb.O 1.080 34.0 40.0 1.075 34.7 37.8 1.110* 
.0389 «41.8803 ---- ----- 33.3 35.7 .978 35.7 39.4 .968 36.8 36.3 1.030 32.7 36.3 1.068 
25 14.3 .0698 350 37.5 39.6 1.006% 34.9 39.0 1.070" 34.0 36.7 .902* 18.0 26.0 .645# ---- ---- ----- 
100 14.3 0698 1.40 34.8 36.5 1.000 33.8 36.5 1.000 36.2 40.7 1.000 34.5 37.2 1.000 30.3 34.0 1.000% 
225 14.3 0698 3.15 33.2 35.0 .960 30.8 32.8 .899 31.4 36.6 .899 33.3 33.8 .909 30.0 34.0 1.000 
400 14.3 0698 5.60 -- -- 25.2 28.8 .790 27.7 30.6 .752 27.3 31.2 .839 25.4 30.4 .893 : 
25 28.6 1396 L. O75 34.2 35.7 .978% 32.0 36.0 .987* 31.7 35.0 .860" 19.8 26.0 .698# ---- ---- ----- 
100 28.6 .1396 4%&.30 25.5 28.5 .782 26.7 30.3 .831 29.2 32.0 .786 30.0 33.0 .888 23.8 29.2 .856# 
225 28.6 1396 «9.68 21.4 23.3 .638 20.0 24.0 .658 23.0 23.0 .565 21.4 26.8 .667 21.3 26.9 .791 
400 28.6 .1396 17.20 18.5 21.5 .589 16.0 18.5 .507 17.0 19.5 .4&79 16.0 20.8 .559 16.1 21.7 .638 
25 57.2 .279 35.25 25.2 27.5 26.53 26.8 .735* 26.5 27.7 .681* 19.3 23.8 ---- ---- ----- 
: 100 57.2 .279 13.00 16.8 20.6 .565 16.2 19.2 .527 17.0 21.0 .516 16.8 20.2 .543 13.2 16.7 .4ol® 
225 57.2 .279 29.20 14.5 17.0 11.3 16.5 .452 11.7 15.3 .376 12.2 17.2 9.7 13.7 
400 57.2 .279 52.00 11.3 15.5 .425 10.5 16.2 .Wbh 9.5 13.8 .339 9.5 13.7 .368 7.0 10.0 .29% ; 
3 .558 9.82 14.2 17.4 .477* 14.0 16.8 .461* 14.0 17.0 .417# 14.0 16.5 6.0 9.8 .2680 
5 10.0 15.4 .422 8.6 13.0 .357 10.3 15.6 8.6 13.1 .352 7.0 10.8 .316# 
3 .558 68.4 9.2 13.2 .362 7.7 1.5 .315 9.5 13.3 .327 8.0 13.2 .352 6.5 10.53 .303 ‘ 
397.2 7.8 13.5 7.5 13.0 68.5 125 8.4 13.0 7.3 8.1 
%e 36.5 36.5 40.7 37.2 34.0 
100 100 
3 
60 + + + 1 + + + + + — 
Sig “el Ap _ 
25 Ps; 
40 4 + + b 4 4 4 
57 
-2 Dsig } 100 psi 
114.3 psi | 
| 


612 


TRANSACTIONS OF THE ASME 


Py'*. It was therefore con- 1.3 
sidered desirable to disregard 


this small number of points and 


retain the simple expression. 
Fig. 9 is a plot of all values of 


¢/¢o obtained except those ex- 
cluded by the qWe < 8 X 105 9 


criterion. Fig. 10 represents a ¢ 
curve drawn through the aver- /, 


age values of the ¢/g¢o points 
shown in Fig. 9. From these 


| 


figures, it can be seen that the 
change in ¢/¢» is small for Py'* 


Nozzle Size Nominal Spray Angle 


< 1, becomes larger for 1 < 
Py} < 60, and begins to de- 
crease for Py'* > 60. 

One explanation of the de- 
crease of spray angle with in- 
creasing P which has been ad- 


45° 80° 


vanced (8) assumes that increas- 
ing the air density at the core of 
the fuel vortex increases the fric- 
tional losses there. This, in 
turn, increases the ratio of axial 
to tangential velocity, thus pro- 


ducing a spray-angle decrease. 
We believe that this effect is not 


an important one for the follow- 


ing reasons: (a) A nozzle con- 
structed with a solid core at the 


fuel vortex (i.e., no air core) still 


produced spray-angle reduction 


with increasing P and y. (b) 
Photographs of the spray while 


showing a curvature in the spray 
outline indicate that the spray- 
cone angle, as defined in the In- 
troduction, remains essentially 
constant. An example of this 
can be seen in Figs. 11 and 12 
which represent the spray from 


©o Average of spray angle ratios for + 


e Average of spray angle ratios for | 


Symbols 


8 to |O nozzles 


two nozzles 


a 60-gph 80-deg nozzle at tank 
pressures of 1.5 psia and 114.3 


BEB 


| 


psia, respectively, with nozzle 
pressure drop held constant at 
100 psi. 

From our experiments, the indi- 
cations are that the phenomenon 
of decreasing spray angle is caused by aerodynamic effects due to 
the motion of the fuel spray through the ambient gas. The fuel 
emerging from the nozzle at high velocity entrains gas at the 
inner and outer surfaces of the spray sheath. (By spray sheath 
we mean an imaginary surface which encloses the fuel spray in 
space.) However, the gas supply to the inner portion of the spray 
sheath is limited by the enclosed volume in the sheath, as shown 
in Fig. 16. Of course for g = 180 deg, the gas-flow path would be 
equally unrestricted for both sides of the spray sheath; but, we 
are dealing here with the more common type of nozzles where ¢ is 
generally less than 90 deg. Pressure differences resulting from 
this effect (P; — P; in Fig. 16) have been measured previously by 
one of the authors for swirling annular air jets used on air-assisted 
fuel nozzles (9). Pressure-difference data were obtained in the 
current tests for a 45-gph 80-deg nozzle and are shown plotted in 
Figs. 13, 14, and 15. Figs. 13 and 14 are plots of spray-sheath 
pressure difference versus tank pressure and versus nozzle pres- 
sure drop, respectively. Fig. 15 presents equivalent spray-angle 


| 


Fig. 10 Averace Curve, Versus Py!-6 


versus spray-sheath pressure difference with tank pressure and 
nozzle pressure drop as parameters. 

In some instances, it was possible to observe droplets being 
carried upward toward the nozzle along the nozzle axis by the re- 
verse gas flow established. In addition, it was found that supply- 
ing gas under pressure at the nozzle axis by means of a tube in- 
serted up inside the spray sheath produced an increase in spray 
angle. Thus we see that a low-pressure region exists inside the 
spray sheath and it appears that this is instrumental in caus- 
ing the spray-angle decrease. The mechanism by which this 
pressure difference causes a decrease in spray angle is proposed 
to be as follows. 

Where the spray sheath exists as a liquid sheet, the pressure dif- 
ference acts directly on this sheet. The distance from the nozzle 
orifice for which this sheet persists may be very small at the higher 
values of P and y. In the region where the fuel spray consists of 
droplets, Fig. 16 shows the static-pressure condition existing in 
a plane perpendicular to the nozzle axis, with P, < P; < P; where 


| | | | 
| | | | 
| | 
| | | | 
5 GPH Ht 
‘001 Ol J To) 100 1000 
Py’ 
Fig. 9 Inprvipvat Data Pornts, ¢/¢e Versus P 
7 
— 
| 


APRIL, 1957 


Fic. 11 Fue. Spray at Low Tank Pressure (60/80 Dec at 1.5 


Psta AND 100 Psi AP) 


Fue. Spray at Hich Tank Pressure (60/80 Dec at 
114.3 Psta anp 100 Psi AP) 


P, represents a mean pressure in the combined fuel droplet-air 
stream, P; is a mean pressure inside the spray sheath, and P; is 
the ambient pressure. The arrows indicate the probable direc- 
tions of gas flow. In Figs. 13, 14, and 15, the spray-sheath pres- 
sure difference refers to P; — P,, where P,; is measured on the 
nozzle axis, '/2 in. from the nozzle tip. The pressure P2 was not 
actually measured but must be taken to be less than P; and P; if 
gas is to be entrained from regions 1 and 3. Thus, in the region 
where the fuel breaks up into droplets, the spray sheath may be 
considered to be droplets and entrained gas. The pressure dif- 
ference P; — P, sets up air flows which produce droplet accelera- 
tion toward the nozzle axis. 

According to the preceding analysis, if a nozzle were con- 
structed having a spray-cone angle of 180 deg, there should be 
no reduction in spray angle since the volumes inside and outside 
the spray sheath are equal. Thus one would expect the effect of 
P and y on spray angle to decrease as the spray-cone angle in- 
creases. One might ask, then, how the g/g» versus Py'-* curves 
for both the 45 and 80-deg nozzles fall so close together. This 
may be visualized as the result of two opposing effects. While the 
pressure drop generated across the spray sheath increases with 
decreasing cone angle, the space available for a change in ¢ is 


~ 


45 GPH-B8O® Nozzle 


Inches 


20 40 60 80 
Tank Pressure, psia 


Spray-SHeatTuH Pressure DirreRENCE Versus TANK 
PRESSURE 


45 GPH - 80° Nozzle 


Inches 


| 


025 100 200 300 500 


Fuel AP, ps 


14 Spray-SHeatH Pressure Dirrerence VeRsus FueEL 
Pressure AP 


Fic. 


- 
Tank Pressure 


; |.5 psia 
45 GPH - 80° Nozzie | o 14.3 
@ 28.6 
| 4 57.2 
114.3 


~ 
~ 


250 400 psi Pe 


10 15 


Inches H,0 


Fie. 15 EquivaLent Spray ANGLE Versus Spray-SHEATH 
PrEssuRE DIFFERENCE 


613 
15 Sp 
1 400 psi 
; 
| 
‘ — 
no 
» 60F } 
\ 
\ 
pal | 
20 
: 


Spray Sheath 


/ 
- P,-\Px Ps 
‘ 
‘ 
\ ~ 
‘ 


< 


Arrows indicate probable path of air flow 


Fie. 16 Spray-SHeatH Pressure DirreRENCES 


correspondingly decreased. Thus, when the results are plotted in 
the form of a ratio g/¢o, these effects tend to cancel giving es- 
sentially one curve on the g/g» versus Py'-* plot. 

From photographs and observations of the spray through the 
windows in the tank, it was concluded that the drop size of the 
spray decreased with increasing tank pressure and nozzle pressure 
drop. An example of the difference in spray-droplet appearance 
can be seen by comparing Figs. 11 and 12 where the tank pressure 
increased from 1.5 to 114.3 psia with the nozzle pressure drop con- 
stant at 100 psi. No actual drop-size measurements were made. 

Since the aerodynamic effects are so pronounced and are related 
to the pressure difference which is produced between the inside 
and outside of the spray sheath, it was interesting to find the effect 
of enclosing the fuel spray with a combustor shape. This was 
done using a typical combustor shape consisting of a hemisphere 
followed by two cylindrical sections all of 8 in. diam. Holes were 
put in the hemisphere to simulate the normal open area at that 
location. Spray distributions taken with a 60-gph 80-deg nozzle 
over the full range of conditions were found to be identical to 
those obtained without the combustor enclosure. 


Summary or Test Resutts 


Fig. 6 shows individual fuel-spray distribution curves. In 
order to represent these curves in a concise manner, a characteris- 
tic angle must be defined which is a representation of these dis- 
tribution curves. It was found that the equivalent spray angle ¢, 
as defined, provided a convenient means of representing a given 
fuel distribution. Calculations have shown that the angle ¢ is 
nearly equal to the vertex angle of a cone which contains half of 
the total fuel spray at a 4'/:-in. radius from the nozzle. 

The ratio of ¢g/go, where ¢ is the equivalent spray angle at 
atmospheric pressure and 100-psi nozzle pressure drop, was found 
to correlate with Py'-*, where P is the nozzle pressure drop in psi, 
and ¥ is the ambient gas density in lb per cu ft. Fig. 9 is a plot of 
values of g/g» versus Py'-* and Fig. 10 represents a curve drawn 
through the average values. From these figures, it can be seen 
that g/g» decreases when the parameter Py'-‘ is increased. The 
greatest rate of decrease occurs for 1 < Py'-* < 60. 

The static pressure difference between ambient and a point 
inside the spray sheath was measured for all test points 
with a 45-gph 80-deg nozzle. For all cases, it was found 
that the pressure inside the spray sheath was lower than the 


TRANSACTIONS OF THE ASME 


ambient pressure. This pressure difference increased with 
increasing tank pressure and nozzle pressure drop and appears to 
be instrumental in causing the spray-angle decrease. An ex- 
planation of the spray-angle decrease is given based on the 
aerodynamic effects caused by motion of the fuel spray in 
the ambient gas. 

Spray distributions taken with the nozzle surrounded by a typi- 
cal combustor upstream section were found to be identical with 
those taken without the combustor enclosure. 

‘Of course, the spray distributions as obtained here will be 
modified by the combustor air flow during actual burning; to 
what extent is not known precisely. From past experience in 
combustion tests, we know that a change from an 80 to 60-deg or 
60 to 45-deg nozzle results in considerable variations in combus- 
tion efficiency, flame length, and combustor-wall temperatures. 
From this, we may surmise that the equivalent spray angle is an 
important factor and that the variation in equivalent spray angle 
must be considered in cases of combustor scaling or new designs 
where differences in fuel or ambient gas pressures will be en- 
countered. An example of such a case would occur in attempting 
to predict the performance of a combustor at operating pressures 
of the order of 6 atm from combustion tests at atmospheric 
pressure. 


ACKNOWLEDGMENTS 


The guidance of Dr. A. E. Hershey during the course of this 
project is appreciated. We also wish to acknowledge the as- 
sistance of Mr. A. T. Pieczynski who helped in compiling data and 
took the spray photographs. 


BIBLIOGRAPHY 


1 “The Design of Constant and Variable-Capacity Mechanical 
Oil Atomizers,” by J. F. Harvey and A. 8S. Hermandorfer, Trans. 
SNAME, vol. 51, 1943, pp. 61-82. 

2 “The Atomization of Liquid Fuels,"’ by E. Giffen and A. 
Muraszew, Chapman and Hall Ltd., London, England, chapter 4, 
1953. 

3 “Factors in the Design of Centrifugal Type Injection Valves for 
Oil Engines,” by W. F. Joachim and E. G. Beardsley, NACA Report 
No. 268, 1927. 

4 ‘The Effects of Fuel and Cylinder Gas Densities on the Char- 
acteristics of Fuel Sprays for Oil Engines,’’ by W. F. Joachim and 
E. G. Beardsley, NACA Report No. 281, 1927. 

5 “Oil-Spray Investigations of the N.A.C.A.,"’ by W. F. Joachim, 
Trans. ASME, vol. 49-50, Paper OGP-50-6, 1927-1928. 

6 “A Comparison of Fuel Sprays From Several Types of In- 
jection Nozzles,”” by D. W. Lee, NACA Report No. 520, 1935. 

7 “Combustion and Combustion Equipment for Aero Gas 


’ Turbines,” by E. A. Watson and J. S. Clarke, Journal of the In- 


stitute of Fuel, vol. 21, no. 116, Oct., 1947, pp. 1-34. 

8 “The Atomization of Liquid Fuels for Combustion,"’ by J. R. 
Joyce, Journal of the Institute of Fuel, vol. 22, 1949, pp. 150-156. 

9 “Air-Assisted-Fuel Nozzle Development,” by 8S. M. DeCorso, 
Westinghouse Research Laboratories, Pittsburgh, Pa., Research 
Report R-94451-1-C, 1954. 


Discussion 


H. Ciare.* A most interesting feature of this paper is the 
novel method of assessing spray-cone angle, which is shown to 
vary with applied fuel pressure even though a silhouette of the 
spray cone would appear to be sensibly constant. The phe- 
nomenon of the change in cone angle with ambient pressure is 
also well illustrated. However, the hypothesis would be more 
conclusive if results were included to show the effect of ambient 
pressure on the flow number of the nozzle. If the effect of high- 
density gas at the air core of the nozzle is to increase friction 
losses in the fuel vortex, and hence the ratio of the axial and tan-- 
gential velocities, it is reasonable to expect that any reduction in 


*Chemical Physics Dept., Ministry of Supply, National Gas Tur- 
bine Establishment, Pyestock, Farnborough, Hants, England. 


APRIL, 1957 


spray-cone angle due to this mechanism would be accompanied 
by an increase in the discharge coefficient. 


Avutuors’ CLOSURE 


While we are in general agreement with Mr. Clare’s remarks, 
we note that the definition of the term “spray-cone angle’’ as 
used in his discussion evidently differs from that given in the 
introduction of the paper. Thus, we would say that the spray- 
cone angle remained essentially constant while the spray angle 
(as measured at the sampling tubes) varied. Concerning the 
flow-number variation, an interesting point is raised which might 
well have been covered in the paper. As Mr. Clare points out, 
any increase in frictional losses at the fuel vortex should cause an 
increase in the flow number. Examination of the flow-rate data 
(not given in the paper) showed no increase in flow number with 
increasing ambient pressure, indicating that the increase of 
frictional loss at the fuel vortex is negligible. 

The following additional test results were presented orally by 
Mr. DeCorso at the Gas Turbine Power Conference because it 
was felt that the scope of application of the data was increased 
through this additional information. We have, therefore, 
included these additional results in the discussion. 

Fig. 10 of the paper gives the relation between ¢/¢o, the dimen- 
sionless equivalent spray angle, and Py'-§. Data for this curve 
were obtained from ten nozzles of different capacities but all 
having spray-cone angles of either 45 or.80 deg. These nozzles 
were all manufactured by the same company. To increase the 
usefulness of this curve, it obviously would be desirable to deter- 
mine whether it applied for nozzles made by other manufacturers 
and nozzles of different spray-cone angles. Fig. 17 of this closure 
is a plot of this same curve with the additional data points repre- 
senting nozzles of different manufacturers and additional spray 
angles of 30, 60, and 100 deg. It can be seen that only the points 
for the 60/100-deg nozzle are significantly displaced from the 
curve. As the spray-cone angle increases toward 180 deg, there 


"Aver Curve, IO Nozzles 
17 GPH/60° Mfr 
lv 32 GPH /30° Mfr. 
60 GPH/100° Mfr 
‘o 60 GPH/ 80° Mfr %2 
60 GPH/80° Mfr °3 
60 GPH /60° Pintle 


T 


Ol 1.0 
Py 
vs py'® 


Fic. 17 AveraGe Curve AND Data 

should be less reduction in spray angle with changes in pressure 
and density because the aerodynamic effects inside and outside 
the spray sheath become more and more similar. The 60 /60- 
deg pintle nozzle was manufactured in our shop. This nozzle 
without any fuel swirl showed spray-angle variations similar to 
those encountered with the other nozzles. 
that the curve predicts the spray-angle changes for nozzles hav- 
ing listed spray-cone angles varying from 30 to 80 deg. This is 
the range of spray-cone angles generally encountered in fuel-oil- 
burning gas turbines and many other combustion installations. 
From the observation that nozzles by different manufacturers 
showed the same variation of spray angle with changes in nozzle 
pressure and ambient gas density, it can only be concluded again 
that these spray-angle changes are primarily produced by aero- 
dynamic effects acting on the spray after it has left the nozzle tip 
and that variations in the internal geometry of the nozzle do not 
affect results significantly. 


These results indicate 


615 
1.3 
‘ 
MFR*) ‘pe 
o7 tet ttt 


-e 
2 
. 
: 


An Experimental Arrangement for the 
Measurement of the Pressure 
Distribution on High-Speed 
Rotating Blade Rows 


By K. LEIST,! AACHEN, GERMANY 


For several years past the research staff of the Institute 
for Turbomachines of the Aachen Technical University has 
carried out measurements on rotating turbine blading. 
This program is part of a comprehensive effort directed 
toward the experimental investigation of the three-di- 
mensional! flow through axial-flow turbomachines. 


INTRODUCTION 


EASUREMENTS on rotating blades are of interest be- 
cause the effect of rotation of the row, and of the 
centrifugal forces arriving therefrom, upon the flow of 

the working medium so far has been explored but slightly. It 
scarcely can be expected that this very complicated flow can be 
determined precisely in its entirety. However, it is intended to 
gather a maximum of significant information from measurements 
on coaxial cylindrical sections. This information may not be 
adequate as a basis for the establishment of a generally valid 


method of analytical treatment of the flow because of the elusive 
nature of the mutual influence of numerous contributing in- 


fluences. Nevertheless, the results of such measurements should 
be an important contribution to the three-dimensional theory of 
axial-flow turbomachines. 


EXPERIMENTAL SETUP 


To start with we shall discuss the experimental arrangement. 
It was developed for the purpose of measuring the pressure dis- 
tribution over the surface of the blades of rotating axial or radial- 
flow-turbine or compressor-blade rows. The development and 
construction of this experimental arrangement was undertaken 
and carried to completion at the Aachen Technical University. 

Some mechanisms previously described in the literature have 
been used by, e.g., Fuhrmann, Betz and Mautz, Keller and 
Bleuler, Weske, Runckel and Davey, Himmelskamp, Morelli and 
Bowerman. Apart from the fact that all except Keller’s inves- 
tigation pertain to retarding flow, that is, compressor blading, the 
speeds of the previous devices are about 2000 rpm and less; and 
the peripheral speeds exceed the 100 m/sec limit only in the case 
of the Runckel and Davey investigation which is 117 m/sec. 

In the arrangement at Aachen University, peripheral velocities 
of 170 m/sec are produced with speeds of about 10,000 rpm. Con- 
sequently the diameters oi the rotor are so small that the centrifu- 
gal accelerations, the effect of which upon pressure distribution 


1 Professor, Technical University. 

Presented at the Gas Turbine Power Division Conference, Wash- 
ington, D. C., April 16-18, 1956, of THe American Society or 
MECHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, 
March 15, 1956. Paper No. 56—GTP-13. 


is one of the chief objectives of our investigation, are increased 
So we hope that the influence 
of the centrifugal force upon the flow will be rendered as precisely 
Measurements in the wake of rotor blades by means 
of a rotating probe were made by Weske who reported important 


sevenfold relative to the others. 
as possible. 


results. 

The first version of the test setup at Aachen University, de- 
veloped by W. Fister,? which already had solved some questions 
of the problem, required several important improvements, some 
of them of a practical nature. It was found necessary, for in- 
stance, to reduce wear of instrumental components to insure 
reliable operation over extended periods. 

On the basis of this experience an improved experimental ar- 
rangement was developed by W. Dettmering. This second 
setup has given satisfactory service for about 250 hr of operation 
with exactly reproducible results at various operating conditions 
and speeds. It was used for a comprehensive program of sys- 
tematic measurements. 

The principal differences to be expected between the flow past 
fixed straight grids and the flow in normal operation of a turbo- 
machine past rotating blades are as follows: 


(a) Effect of variation of pitch along the radius because of the 
fanning of the blades in the rotor wheel. This effect can be de- 
termined from investigations of the fixed cylindrical grid. 

(b) Centrifugal effects in the boundary layer along the wall 
of the blades and the radial secondary flow arising from it. 

(c) Variation of pressure along the radius ahead of the rotor 
blades and the radial-flow component arising therefrom, especially 
for long blades. 

(d) Tip leakage through the clearance between rotor blade 
and casing and its influence upon pressure distribution near the 
tips of the blades. 

(e) Wake flow behind the stator blades. 

(f) Variation, along the length of the blades, of the direction 
of flow at the leading edges. 


The second experimental arrangement for the measurement on 
rotating blading is shown in Fig. 1. The turbine is coupled to a 
hydraulic torque dynamometer to determine its output. Fig. 2 
shows the over-all arrangement of the entire unit in a test cell. 
The cold-air flow through the turbine is induced by suction by 
means of a compressor. Much care has been taken to achieve 
satisfactory axially symmetrical inflow at the stator wheel; 
all disturbing influences have been eliminated. The stator row 
may be adjusted circumferentially with center in the shaft axis 
by a manually operated mechanism. The essential parts of the 
instrumental equipment are described with reference to Fig. 3 as 
follows: 


2 “‘Druckverteilungsmessungen an umlaufenden Turbinenschau- 
feln,’’ by W. Fister, “*VDI-Forschungsheft” 448, 1955. 


617 


TRANSACTIONS OF THE ASME 


Fic. 1 Tursine Test Stanp ror MEASUREMENT OF PRESSURE DISTRIBUTION ON ROTATING 


Buiapes. Maximum Speep n = 13,000 Rem 


Fic. View or ExpertMentat Unit 1n Test CHAMBER 


Relay and contact 


Fic. 3 ScnHematic Drawine or MeasuriInG Device 


Surface of measurement ; 2 3 


Fic.4 Prastic BLape Wits Pressure Hoies 1n SEVEN PLANES OF 


MEASUREMENT 


The testing rotor blade made of a fusible 
plastic (Araldit Giessharz B) and glued into 
an aluminum ring was provided with 20 
cast-in pressure conduits of 1 mm diam. 
Connections to each of the conduits of 0.4 
mm diam were drilled normal to the surface 
of the blade. This blade was tested up to 
the breaking speed of 17,000 rpm. There 
are 20 such holes distributed over the front 
and back of the profile in each surface of 
measurement, Fig. 4. Altogether seven 
surfaces of measurement are provided at 
various radii from tip to root of the blade; 
that is, 140 holes on one blade. The loca- 
tions of the measuring points are shown in 
Fig. 5. The pressure distribution in only 
one of these surfaces can be measured in 
any one test run, the pressure holes of the 
remaining six being closed off with thin 
adhesive plastic tape (Fig. 7). 


618 
Py 
H disk 
Motor 
Pressure leods 7 
\ 


APRIL, 1957 


(2 


Fic. 5 Disrrisution or Pressure Hotes Arounp BLapE 


Fic. 6 Assempiy With BLapve ror MEASUREMENTS 


A test wheel is shown in Fig. 6. The 20 pressure leads are con- 
nected to terminals on a disk rotating with the wheel and then 
transmitted by a selector mechanism te a disk rotatable relative 
to the wheel. This disk is operated by an electric motor placed 
within the turbine rotor. The connecting sliding surfaces of the 
selector mechanism are designed carefully to prevent any leak- 
age. In this way the pressures are transmitted without error 
from the rotating turbine shaft to the multi-U-tube manometer 
arranged outside. The rotating pressure seal, connecting the 
rotating and fixed parts of the leads, is kept to minimum diameter 
and sealed by two packing rings with intermediate oil chamber. 
Thereby friction and heat generation are kept to a minimum. 
Provision is made also for water cooling, yet this was found 
necessary only for the longest periods of measurement. 

This type of seal between the rotating and fixed parts has given 
excellent service in many hours of operation. Pressures trans- 
mitted by the seal are successively applied to one of the 20 U- 
tubes by a distributor and selector disk of similar design to that 


View or Brave Wits Puiastic Fitm Covertna Pressure 
Partty Removep 


Fic. 7 


Fic. 8 Drive Moror ror Distrisutor MECHANISM 


in the turbine. In this way a complete representation of the 
pressure distribution is produced which then may be photo- 
graphed. A further merit of this method lies in the fact that it 
provides an automatic check against leakage, as the pressure 
reading at the time of its measurement may be compared with 
the pressure read after a considerable time, e.g., one night, has 
elapsed. This check has been operated frequently. 


OPERATION 


As shown in Fig. 8, a second selector disk operated by a posi- 
tively synchronous motor serves to distribute the pressures suc- 
cessively among the various pressure leads. 

It is important that the two selector disks be synchronized in 
order that a particular pressure hole be co-ordinated with the 


corresponding manometer column. Available synchronous mo- 
tors could not be selected for this purpose as their torque was in- 
sufficient. Instead, two d-c motors operated by relay control 
were chosen for this drive. The selector disks are rotated slowly 
by them through a reduction-gear train of 1:5000 gear ratio. 


619 
lie 
(16) so 
| 


620 


The operation of this mechanism is checked by a position in- 
dicator. For test stand II, an arrangement of electric bulbs was 
used indicating the position of the two selector disks. Current 
supply to the electric motor, initially by way of brushes and 
rings, now is admitted to the rotating system through standard 
ball bearings. This is a very satisfactory solution. The transi- 
tion resistance which, in the case of brushes, rises sharply at high 
speeds, in the case of ball bearings remains essentially constant 
independent of speed. Consequently the voltage once chosen 
may be kept the same for all speeds. 

As both selector disks, by the action of their respective motors, 
are rotated to the next bore of the distributor, a silver contact pin 
fixed on the selector disk and rotating with the disk touches a 
contact bar and, closing the circuit, stops the rotation of the 
motor by way of a relay. Each motor is stopped separately 
after the next bore has been reached. An electrically controlled 
friction brake serves to arrest the selector disk at precisely the 
desired position. 

After a certain time interval controlled by the charging of a 
condenser, the length of which is chosen such that equilibrium 
pressure of the manometer liquid column is attained, the motors 
receive a new impulse and the next cycle begins. The duration 
of this impulse is freely adjustable and it must suffice to advance 
the selector disks beyond the pressure bores and to separate the 
contact pin and bar. Yet it must not be so great that the next 
bore is passed before the breaker relay has a chance to get into 
action. This completely automatic and widely adjustable selec- 
tor mechanism for the 20 pressure taps of a profile section has 
proved to be very satisfactory. 

The control desk with signal bulbs and indicating instruments 
for selector-disk operation is shown in Fig. 9. In the background 
is the test cell with the test stand. 


Fic. 9 View or Contro. PANELS IN OPERATING STAND 


The details of design of the pressure distributor arranged in the 
hollow shaft of the turbine rotor can be seen in Figs. 10 and 
10(a). The apparatus is relatively extensive and complicated 
which is a necessary outcome because of the requirements of 
short periods of readings for the systematic measurement of 
sequences of indications, of high reliability of operation, and great 
accuracy. All this has been achieved successfully as can be 
shown by exact reproducibility of any reading. Simultaneous 
with the pressure-distribution measurements, readings are taken 
by means of flow-measuring probes of the inflow and outflow 
conditions for thie rotating blading. 

The effect of centrifugal forces upon the fluid in the pressure 


TRANSACTIONS OF THE ASME 


leads was taken care of by a correction factor K defined as follows 
P, = P)K 


where Po is the pressure reading on the manometer board con- 
nected to the pressure lead from the seal on the center line shaft, 
P,, is the pressure at the point of measurement, both with respect 
to atmosphere. 

To determine the correction factor K, one of several assump- 
tions can be made 


1 Constant density of the air in the rotating tube. This as- 
sumption is frequently made but it leads in some cases to con- 
siderable errors. 

2 Constant temperature of the air in the rotating tube. 

3 Polytropic temperature and pressure distribution in the 
rotating tube. 


Since the assumption of constancy of air temperature through- 
out the rotating tube appears to come close to actual conditions 
it was used for computing K, particularly since it can be shown 
that slight variations of the temperature have little influence 
upon K; a change of 10 deg of the temperature assumed con- 
stant caused a 0.5 per cent change in the value of K. 

By isothermal relations the correction factor is a function of 
the radii of the point of measurement r, and of the point of transi- 
tion from the rotating to the stationary lead ro, the speed of rota- 
tion n = 60 w/27, and the temperature 7 
P w?(r? — 

K(r,n,T) = — =e 
Po 
where Hy = RT) = Po/7Yo pertaining to conditions in the fixed 
pressure leads r = 0. In the present case the pressure seal was 
on the center line of the shaft, hence rp = 0. Consequently 


K(r, n, T) = P, 
The influence of variations of barometric pressure is eliminated 
by reducing the corrected pressures to nondimensional form (P,, — 
P, ) 
For purposes of comparison, the same curve of pressure distri- 
bution has been shown in Fig. 11, first without any correction, 
next as corrected on the assumption (1) of constant density (y = 
const), and finally on the assumption (2) of constant temperature 
T = const. An experimental check was made of the validity of 
the correction factor K by rotating a pilot tube which measured 
the total pressure of the peripheral velocity. The reading when 
corrected by the foregoing factor was in close agreement with the 


u? 
2 at the point of measurement. A 


total pressure Psat. + p* 


further check was made by filling the leads with different gases, 
e.g., air and hydrogen. The different readings, when corrected, 
resulted in the same value for the pressure at the point of meas- 
urement. 
Cueck Tests 

Since for the novel experimental arrangement under discussion 
a check was to be made through comparative measurements on 
the fixed grid, a preliminary investigation already had been made 
on stand I in which systematically all effects causing discrepancies 
between the fixed and moving grid had been eliminated. This 
was done by: choice of identical profiles, axial inflow through 
a straightener section with very thin walls (without inlet guide 
vanes); measurements on the pitch circle at very low rotative 
speeds; equal pitch in the fixed grid and on the pitch circle of the 
rotor grid. 

Discrepancies between the two methods of investigation in this 
way were largely eliminated, namely: 


| 


APRIL, 1957 


Fic. 10(a) Detatts or Swarr ContTaininc Measurinc Devices 


width of the grid 


INFLUENCE OF CORRECTION FOR CENTRIFUGAL EFFECT 


— Pressure distribution on plane grid (fixed) 
———Pressure distribution on rotating grid with guide vanes 
Fic.10 Section or DistRipuTOR MECHANISM DrIvE removed at about 200 rpm 
Motor anp GEAR IN TuRBINE SHart (INDIVIDUAL ELEMENTS) ComPARATIVE MEASUREMENTS WITH THE PLANE GRID 


621 
“4 
dL 
mie 
Core 
/@ tc 
| 
| sis | | 


622 


The influence of centrifugal forces in the boundary layers by 
means of low speeds (200 rpm). 

Pressure increase along the radius, by means of axial infiow. 
The influence of wakes of the guide vanes by means of the thin- 
walled straightener section. 

The leakage flow in the radial clearance, by measuring on pitch 
circle. 

The result of these measurements indeed shows extensive 
agreement of both curves (cf. Fig. 12). 


INFLUENCE OF PitcH Ratios 


As a part of the first phase of the program of measurements we 
are conducting basic investigations concerning the influence of 
different pitch ratios. Fig. 14 shows the developed blade grids 


of the root, pitch, tip circles for the blading so far investigated 
(Fig. 13). 


These are in all cases prismatic blades with the same 


tl 50 


uo) / 


Rotatinc TurBINnE WHEELS Testep WITH STRAIGHT 


Fic. 14 


TRANSACTIONS OF THE ASME 


@ suction eurface 
O pressure surface 


-R ne 2000 [rpm] 


Wheel 20/48 t/l = 1.25 
Wheel 32/48 t/l = 0.78 
Wheel 50/48 t/l = 0.50 


Fic. 15 Pressure Distrisputions, Section 4, n = 2000 Rem 


n= 6000 (rpm! 


Wheel 20/48 = 1.25 
Wheel 32/48 = 0.78 
Wheel 50/48 t/l = 0.50 


Fic. 16 Pressure Distrisutions, Section 4, n = 6000 Rem 


blade angle 8, in all sections. The stator wheel was the same for 
all rotor rows. It consists of simple straight sheet-metal blades. 

Figs. 15 to 17 are presented as an example of results obtained. 
They show the pressure distributions for the same surface of 
measurement of the three different wheels, plotted against the 
nondimensional width b of the grid. These measured results are 
shown for speeds from 2000 to 10,000 rpm. 

The ordinates are the difference of the static pressure at any 
given point of the profile P, and the static pressure ahead of the 
grid P, divided by the stagnation pressure of relative inflow 
velocity 


p 
“a = 2 w,? 
This pressure ratio should have its maximum value unity at 


the stagnation point. Correspondingly, at the trailing edge of the 
blade there should appear the entire local pressure drop for the 


w,? 
* 
= DON 
Fie. 13. Taree ExPeRIMENTAL WHEELS INVESTIGATED 
© suction surface 
pressure surface 
Root 
ircle XN | 
Pitch | | | 
ircle 
1248 / 0.760 0499 / 


APRIL, 1957 


section of measurements. The pressure measured by the bore 
at the trailing edge of the blade, however, is a little lower than 
the static pressure P, measured by a static-pressure probe slightly 
behind the grid. 

It is interesting that from a comparison of the stagnation pres- 
sure distributions is to be seen that the full stagnation pressure 


qi 


is not reached at higher speeds. 


This may be caused by radial 


components of velocity resulting, for example, from centrifugal 


ae 


me 10000 [rpmi 


ti 
© pressure surface 


Wheel 20/48 t/l = 1.25 
Wheel 32 /48 t/l = 0.78 


Wheel 50/48 t/l = 0.50 


17 Pressure Distrisutions, Section 4, n = 10,000 Rem 


Fic. 18 Pressure Distripution, Section 4 ror Rotor 20/48, 
t/l = 1.25 


forces in the boundary layer or else by inaccuracies of the meas- 
urements of the static-pressure probes ahead and downstream 
of the rotor wheel. It was found that the shape of the curves 
near the leading edge could not be determined accurately since 
there had not been enough room to provide for a sufficient number 
of points of measurements around the nose. Deviations at the 
trailing edge may be explained by flow around the trailing edge. 


© suction surfoce 
© pressure surface 


Fic. 19 Pressure Distrisution, Section 4 ror Roror 32/48, 
t/l = 0.78 


| 
Pressure Distripution, Section 4 ror Rotor 50/48, 
t/l = 0.50 


Fic. 20 


Figs. 18 to 20 show the same curves plotted versus speed as 
parameter, each for constant number of blades. 

For comparison of the pressure distributions at the same rela- 
tive inlet angle 8,, there are shown in Figs. 21, 22, 23 the pres- 
sures taken at radial Section 4, for two wheels identical except 
for a different number of blades. These measurements show the 
effect of closer blade spacing upon the conversion of pressure. 

Figs. 24 and 25 show isometric presentations of the pressure 
distributions of the grids with 20 and 50 blades in all seven sec- 
tions of measurement located along the blade height. 

The width of the profile b was chosen as a reference line. In 
this case also the static-pressure difference P,, — P; was computed 
and rendered nondimensional with respect to the velocity pres- 


sure of the relative velocity at inlet = 


The tests just described were carried out with a turbine unit of 
the following dimensions: 


623 
=] 
1000 
-10 
2000 
ap 
6000 
7 
“12 
~ (6000 
\ 
ae tw? “a A, 8000 
$000 
/ ial © pressure surfoce 
+000 
A, 
won 
: 
© pressure surfoce 


@ scion arfece 
© pressure surtace 


COMPARISON OF PRESSURE DISTRIBUTIONS, SECTION 4 FOR 
ANGLE 


Fie. 21 


ee 


@ suction surface 
© pressure surface 


For Wheel 20/48, n = 8000 Rpm 
For Wheel 32/48, n = 4000 Rpm 


Fic. 22 Comparison oF Pressure DistrRiBuTIONS, SECTION 4 FOR 
Equat ANGLE 


Height of blades... A= 41mm 
Pitch-circle diam...... Dm = 283 mm 

h 
hence D. = 1:7; rs = 0.75 
Depth of profile (chord)...1 = 35.6 mm 
Blade angle...... B, = 48 deg 
Blade-inlet angle. .. Bi = 84.0 deg 


Blade-outlet angle..... Be = 20.5 deg 


The profile is a reaction profile. Measurements were obtained 
for speeds up to 12,000 rpm, which corresponds to a blade-tip 
velocity of 203 m per sec. The pressure ratios were chosen such 
that the Reynolds number was kept constant at 


15 Re = =F <2. 108 


The Mach number was 


where w, = relative velocity at outlet. 
less than 0.5. 


TRANSACTIONS OF THE ASME 


© suction surface 
© pressure surface 
For Wheel 20/48, n = 10,000 Rpm 
For Wheel 50/48, n = 6000 Rpm 
Fic. 23. Comparison OF Pressure DistripuTions, SECTION 4 FOR 
ANGLE 


For Wheel 32/48, n = 10,000 Rpm 
For Wheel 50/48, n = 8000 Rpm 
lic. 24 CorreLation or Pressure DistrisuTion ALONG LENGTH 
or ror Wueet 20/48 (¢/l Prrcw Crrcte = 1.25) 


The results presented are examples illustrating the relative 
variations of the pressure distribution with blade pitch. 

Since in the establishment of these curves measured data ob- 
tained with fixed probes upstream and downstream of the rotating 
grid were used, they are of the same accuracy as any information 
obtained from the readings of probes, including those for fixed- 
blade rows. 

Experimental work is at present under way to eliminate as 
much as possible the influence of the probes for the particular 
case of our setup. The inaccuracies caused by them are not an 
inherent feature of the method for the determination of the pres- 
sure distribution on rotating grids presented here. A precise 
evaluation of all details of the data on hand should be deferred to 
the time when these special effects have been clarified and when 
more measurements are on hand for correlation. 

A closer interpretation of all details of the measurements 


624 
-6 \ 
-6 
-8 
4 
— 
SSS 


APRIL, 1957 


45678910 12 


a 


7 
/ 


& 


| 


\ 


& 


32// | | 
29282726 25 2% 


Fic. 25. Corre.ation or Pressure DistriputTion ALONG LENGTH 


4 


or Biape, ror Rotor Wueet 50/48 (t/l Prrcw Crrcte = 0.5) 


| 


Fic. 26 ApsustaBLe Prope Mounted on Rotor 


Over-ALL ViEW OF THE DEVICE FOR THE ADJUSTABLE Ro- 
TATING PROBE 


Fic. 29 Resutts or A WAKE MEASUREMENT WiTH a RoTATING 
Fic. 28 Tip or THe ApsusTaBLe Prope Tora. Pressure Prose 


625 
SS | ram 
r 
| on c 
| 8 | 
— 
V 
DD 


626 


taken so far will not be attempted until a greater number of 
measured results are available for mutual correlation. 

The experimental equipment will be augmented further by a 
rotating wake-traversing probe adjustable circumferentially over 
approximately two pitch distances. It is shown in the design 
drawing, Fig. 26. An attempt will be made to measure with 
this probe the wake troughs of the total-pressure curve and to 
determine in this manner local losses and their contributing 
causes. The picture shows, on the left immediately adjacent 
to the turbine wheel, a disk from which the probe projects. 

The disk rotates with the rotor, but it can be moved periph- 
erally relative to the rotor during operation by means of a sta- 
tionary worm and a rotating planetary gear. 

Fig. 27 shows the arrangement of the probe of 1.5-mm inside 
diameter. Its peripheral position is indicated by a potentiometer. 
Fig. 28 likewise shows the probe behind the rotor blading, 
oriented in its threaded mounting to the estimated direction of 
flow at discharge (20.5 deg). To alter the radial position of the 


tube the turbine needs to be stopped. 

The measurements with the rotating total-pressure probe, re- 
cently begun, are aimed at the determination of total pressure 
losses along the span and across the pitch. An example of re- 
sults obtained is shown in Fig. 29, namely, the total pressure 
distribution above outside pressure, 16 mm axially downstream 


TRANSACTIONS OF THE ASME 


of the rotor trailing edges, on 3 different radii, nlotted against 
developed circumferential distance. 

The pitch of wakes equals the blade pitch. The wakes of sec- 
tions 6 and 2 are displaced peripherally with respect to each other 
because of angular deviation and because of a slight slant of the 
trailing edge. The centers of gravity are on a radius. Total 
pressure losses in sections 6 and 2 are considerably greater than 
on pitch cirele. A detailed account of this investigation of 
losses will have to wait until measurements covering the entire 
radial height of blades are on hand. 

It appears unnecessary to discuss further the purposes and 
potentialities of the experimental apparatus described as these are 
self-evident. Worth-while objectives are systematic measure- 
ments on profiles of various designs which differ, e.g., primarily 
in respect to shape parameters, such as median line, camber, 
thickness, thickness distribution, and so on, and further, in respect 
to grid parameters, such as, pitch, fanning, blade orientation, 
and the like. 

This can lead to an investigation of the properties of various 
blades, including twisted blades. In conjunction with total- 
pressure measurements ahead of and downstream of the grid 
there exists further the possibility of facilitating the establish- 
ment of an analysis and itemized account of losses for blades of 
rotors with accelerating and retarding grids. 


r 
| | 
: 


Operating Experience and Design Features 


of Closed-Cycle Gas-Turbine Power Plants 


By CURT KELLER,’ ZURICH, SWITZERLAND 


This paper is the author’s third progress report in the 
United States on the AK closed-cycle gas turbine. The 
first comprehensive presentation in 1945 (1)? dealt with the 
basic ideas and theory, the first 2000-kw experimental 
plant in Zurich, and future prospects, including the use 
of other gases than air, such as helium, nitrogen, 9r car- 
ben dioxide. The second paper in 1945 (2) gave more de- 
tails and design features of different components, with 
special reference to machines and air heater of the first 
industrial 12,000-kw oil-fired plant for St. Denis and 
Dundee which were in the final phase of erection at that 
time. Projects for coal-fired plants also were discussed. 
This third presentation summarizes the achievements 
during the past five years. They are based on operation 
and new design experience and show the marked improve- 
ments obtained by simplifying the closed-cycle system 
components while keeping its basic properties. This de- 
velopment led the way to economical solutions in the dif- 
ferent fields of application. Work was concentrated mainly 
on power-station sets up to 15 mw and for different fuels 
as well as on ship-propulsion plants (3). The recent ac- 
celerated development of atomic power brings a new incen- 
tive to the closed-cycle gas turbine using gases other than 
air as the working medium and reactor coolant. 


GENERAL EXPERIENCE WITH PILOT PLANTS 


Y THE end of 1955, 14 closed-cycle gas-turbine power plants 
B ranging from 700 to 12,000 kw were in operation or being 
built by Escher Wyss and its different licensees (4). Among 
them are one 6600 and two 10/12,000-kw new coal-fired sets. 
The outstanding event of the past year was the successful start 
oi the world’s first industrial pulverized-coal-fired plant. 
Generally speaking, we can state that during the entire de- 
velopment period no basic change of system layout proved to be 
necessary. Working pressures and temperatures as well as regu- 
lating means are still the same as foreseen in our first studies 15 
years ago. However, practical realization of the first pioneer 
plants brought up a number of engineering problems and 
difficulties which, while having nothing to do whatsoever with the 
principle itself, enforced long delays on us. Strangely enough, 
practically no difficulties arose in any high-temperature-region 
components of the plants such as air heater and turbines. Not a 
single tube in any air heater has been damaged in normal service 
nor any high-temperature blading or shaft. Our difficulties oc- 
curred from so-called normal components such as auxiliaries, 
bearings, shaft vibrations, piping-expansion problems, compres- 
sor blades, electrical equipment, and so on. Such matters are 


1 Director of Research and Development, Escher Wyss Ltd. Mem. 
ASME. 

2 Numbers in parentheses refer to the Bibliography at the end of 
the paper. 

Presented at the Gas Turbine Power Division Conference, 
Washington, D. C., April 16-18, 1956, of THe American Society 
oF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, March 29, 
1956. Paper No. 56—GTP-15. 


treated in the following as they may show the difficulties which 
had to be overcome in bringing the first plants into reliable in- 
dustrial service. 

The first two plants, of 2000 kw at the Escher Wyss works and 
12,000 kw in St. Denis for the EDF, were both designed, manu- 
factured, and mounted entirely by Escher Wyss during war and 
after-war times. Impossibility of free choice of material and 
auxiliary equipment at that time was naturally a great draw- 
back. Knowledge for many essential components, for instance on 
heat resisting steels, high efficiency axial compressor blading, and 
extended heat-transfer surface, was not yet very advanced. 

We managed somehow to get through the war with all the tests 
and the pilot plant proved to fulfil all the expectations. For long 
periods, when electricity restrictions existed in Switzerland, the 
plant was the only energy source for the works. An over-all ef- 
ficiency of 32 per cent at full load and the very flat efficiency curve 
over a big load range were the most striking things for the engi- 
neer. A total of 6000 industrial service hours with this test plant 
were run without significant trouble. We want to point out es- 
pecially that the air heater, which was of quite advanced design, 
stood all services without any trouble. Not one single tube 
failed and this is true for all air heaters built up to now. This 
component of the closed-cycle plant which was often regarded as 
most questionable, turned out to be most reliable. 


The regulating procedure of closed-cycle plants is quite simple 
by combining pressure-level variation with by-passing the com- 


pressor. No valve or regulating device is located at the machine 
itself. Inlet, outlet, and bypass valves for the working medium 
are in the cold region (Fig. 1). This is an important advantage 
from the operating point of view. The set answers very quickly 
to load alterations. This fact was apparent in the operation of the 
first 2000-kw test plant and was described in detail in an earlier 
paper. Even when disconnected from the general grid, the speed 
of the small 2000-kw set did not vary more than 2 per cent when 
load was lowered quickly by about 500 kw. A quick shutdown 
produced a speed rise of less than 4 per cent. 

Fig. 2 shows the corresponding behavior of the 12,000-kw St. 
Denis plant. Even at very abrupt load changes, the working air 
temperature at turbine inlets varies very slowly, owing to the 
accumulating favorable effect of air-heater tubes. Many shut- 
down tests from full load showed that even in this complex plant 
speed increase is less than 5 per cent. This low value is due to 
braking effect of the compressor being on the same shaft as the 
turbine and generator. The modern one-shaft sets are even 
easier to control. 

For St. Denis a top pressure of 50 atm (700 psia) and a pressure 
ratio of 10 with intermediate heating cycle was chosen. The 
reason for this choice was mainly the fear that with lower pressure 
ratios the recuperator and cooler dimensions would become too 
great with high mass flow. At that time no appropriate extended 
surface was available and heat-exchanging apparatus had to be 
designed with straight tubes. Furthermore, at that period, air- 
heater design problems were overestimated because of unknown 
nonsymmetrical radiation effects on tube walls at high tempera- 
tures. Therefore the St. Denis air heater was built as a pure 
convection-type apparatus. In order to reduce the surface, 
supercharged combustion was chosen which entailed greater 
complication because of the necessary charging set. 


627 


TRANSACTIONS OF THE ASME 


Fic. 1 Comsinep Bypass anp INLET VALVE FoR CLosep-CycLe PLANT 


(Whole regulating device is in covered box near engineer.) 


Fic. 2. Section or Loap AND TEMPERATURE D1aGRaM or 12,000-Kw PLant Durine Recutatine Tests 


628 
be | | = 
| || ‘ABE @® | 


APRIL, 1957 


In the final uninterrupted continuous run of 700 hr in 1955, 
most valuable measurements and tests were made with runs up 
to full load and top temperature of 675 C (1250 F). An inspection 
of the machine and air-heater system after more than 3000 hr of 
running time showed clean machines, recuperators, and piping. 
Also the outside surfaces of the air-heater tubes were in very good 
condition and no corrosion was experienced. 

In the following we will mention some of the annoying difficulties 
which occurred at St. Denis. None of them related in any way to 
the air-heater performance or with the high-temperature parts. 
The plant was shut down recently after having proved to be 
quite satisfactory in mechanical behavior. At present some 
machines are undergoing general alterations. Based on new ex- 
perience, some of the old blading for compressors and turbines, 
which were damaged by earlier accidents, will be replaced by im- 
proved blades. 

In order to illustrate our troubles a list of some events which 
occurred at St. Denis since starting up in 1952 follows: 

Primary incorrect insulation of a double-wall high-temperature 
inlet pipe to turbine caused some turbine vibrations because cas- 
ings were displaced by unequal temperature elongations. 

After a 200-hr run in July, 1952, a slight vibration of the high- 
pressure (H-P) radial compressor was noted. Shortly afterwards 
the H-P turbine set speeded up and consequently the quick- 
closing device of this group acted automatically and the whole 
plant was shut down as foreseen. An examination of the H-P 
compressor rotor showed that the blades of one runner wheel had 
been broken away. The detailed inspection proved that some 
stationary diffuser blades from the preceding stage had been 
welded unsatisfactorily. Because of the stream forces acting on 
these deflector blades during normal operation the welds became 
loosened and pieces of the blades were thrown into the following 
runner wheel, causing the damage. 

The charging and leakage compressors of the unlubricated pis- 
ton type were unreliable at first and had to be rebuilt by the manu- 
facturer. Now we use rotary Lysholm-type compressors. 

At the start the electrical equipment of the plant caused some 
short circuits in the auxiliary motors, and various faults in the 
electrical relays hindered proper functioning of the safety devices. 
Hence, at a shutdown test, the electrically driven auxiliary oil 
pump did not function and some bearings of the machines were 
damaged. 

The main cooling-water electric motor once caught fire from 
some unknown cause during normal service and required a long 
time for repair. 

The cooling-water conditions at St. Denis were unfavorable 
during summer months as temperatures went up unusually high 
(to 90 F) and the water was very dirty. The coolers required 
frequent cleaning. 

At first, high-pressure and low-pressure sets were running quite 
satisfactorily, but after various necessary dismantlings of the 
machines, the running behavior got gradually rougher after re- 
aligning the shafts. It took a long time to find out that both 
groups were running too close to the critical speed, because the 
theoretical calculation did not indicate this as the cause. Model 
tests showed that assumptions of the calculation were incorrect 
and that the critical speed was too close to the running speed. 
When the shaft and bearings were aligned not too correctly, 
vibrations were avoided by additional bending but the more ac- 
curate the alignment, the worse the running became. Fortu- 
nately, by removing only one bearing in each L-P and H-P set, 
the critical speed of the shaft could be entirely corrected and since 
then the running behavior is perfect. 

Measurements show that the character of the flat efficiency 
curve corresponds to the expectations (Fig. 3) and that the out- 
put is proportional to the pressure level. 


a 


a/culation 


Output 


. 


4 


* 
° 


Load 
025 as 


10 2,0 3,0 at 
Make up pressure on LP side 


Fie. 3) Erriciency anp Output or First 12,000-Kw PLant at 
DirreRENT Pressure Levecs (Loaps) 


The first closed cycle-gas turbine power plant in Scotland, at 
Dundee, built by John Brown & Co. Ltd., differs not too greatly 
from the St. Denis plant in its components. It is also a complex 
two-shaft group. The air heater with supercharged combustion 
chamber is built for oil firing but is much smaller than the cor- 
responding St. Denis apparatus. A cross-flow combustion-type 
air heater is used. This plant was started in the Fall of 1954, 
without too great difficulty. However, after less than 100 hr of 
service the alternator caught fire and was heavily damaged. The 
reasons for this accident are not clear. This failure delayed opera- 
tion of the plant for a year. Later a blade damage in one of the 
axial compressors occurred, similar to a corresponding failure in 
the St. Denis plant. As the forces in compressors running at high 
density are greater than in normal atmospheric or low-pressure 
compressors, improved structural design of stator and runner- 
blade roots is necessary. Work of altering the Dundee plant is 
still under way. 

In addition to the 12,000-kw Dundee plant, John Brown Co. 
has pioneered and completed successful development work and 
tests with coal and peat-fired air heaters (Fig. 4). This work is 
sponsored by the North Scotland Hydro-Electric Board as well 
as by the British Coal Board. A 2000-kw peat-fired industrial 
plant incorporating this air heater and a ‘“Tuco’’ set similar to 
Fig. 7 are under erection in the North of Scotland, as well as a 
2000-kw set with a coal-fired air heater for Scotland. 

John Brown uses a vortex slag-tap combustion chamber for its 
solid-fuel-fired heaters while in the Continental development nor- 
mal conventional pulverized-coal burners with granulating com- 
bustion chambers are used. Furthermore, John Brown has built 
the first 700-kw closed-cycle plant using waste gases from a gas 
works. The start of operation of this plant in Coventry was de- 
layed until the middle of 1955, owing to building restrictions in 
England. This plant is undergoing extensive service tests now. 

It is believed that the waste-heat recovery from all sorts of 
chemical plants offers a promising field of application even for 
smaller outputs, because efficiencies of gas turbines can be kept 
higher than with the corresponding small steam turbines, and 
high temperatures can be used without the complications involved 


629 
1000KW | 
10 
AW | 
| | 
JZ, 640°C | 
| 
% 
620°C 
10 
3 — 09 
2 ——- —08 
0 a6 


TRANSACTIONS OF THE ASME 


Fig. 4(b) Brown's System or Expermmentat Arr-Heater ror Fue (1950) 


630 
| 
Fic. 4(a) Joun Brown's Test PLant ror Peat anp Coat-Firep Air Heaters (1955) 
a, 


APRIL, 1957 


nee a4 


by 


L=9,5m 32ft 


L= 3m 10ft 


with steam turbines. Power production per calorie of waste heat 
is high. If necessary, steam production for process steam can be 
combined with the closed-cycle plant. 


ApVANCED MACHINERY DEsIGN 


As time went on, it became more and more obvious that the 
earlier complex plant design with different compressors and tur- 
bines or 2-shaft arrangement was too complicated as well as too 
costly and uneconomical. It was clear also that the first big 
power plant of 10/12,000 kw, designed and built for the EDF 
(Electricité de France) in Paris around 1946, was developed along 
too academic lines. 

The progress in simplified design between the first 2000-kw test 
plant in 1942 and today’s design 
is best demonstrated by Fig. 5, 
giving both machine sets to the 
same scale. The old turbine 
which ran so satisfactorily is 
the first gas turbine working 
with 700 C (1300 F) in indus- 
trial service. It now occupies a 
place of honor at the permanent 
exhibit of pioneer machines at 
the Deutsche Museum in 
Munich. 

The new Tuco machine proved 
to be very successful from the 
start. Only 4 hr after the first 
run, in 1955, we could connect 
the set to the grid and pick-up 
load. After 500 hr of thorough 
tests this first machine was 
shipped to Japan. It is part of 
the first closed-cycle 2000-kw 
oil-fired plant in that country 
by Fuji-Denki which started 
operation in December, 1955. 
Fig. 6 shows this plant on site. 

We have concentrated our 
later design and development 


Weight 25t 


2000-Kw CLosep-CycLe MACHINERY OF 
Topay's Macuine Set or THE SAME 


Fic. 5 Comparison or 
Test PLant (1939) 


studies on three standard sets (2000-3000, 5000-6000, and L0,000— 
15,000 kw). The two newly ordered 10/12,000-kw sets are being 
built as Tuco sets, where compressor and turbine are combined 
on one rotor, similar to Fig. 7(a). Leakage losses are reduced be- 
cause for this arrangement of the compressor and turbine no 
high-pressure gland is necessary. All these machines can be used 
for oil, gas, or coal-fired air heaters. 

In the near future development of additional standard sets above 
12,000 kw is foreseen. As mass flow of working fluid is becoming 
larger, the best solution for the machine will then be axial-flow type 
machinery. Fig. 8 shows a new compact machine design of 15 
mw suitable also for ship propulsion. Here again no high-pressure 
gland has to be provided, only the low-pressure sides of the 
machines have to be tight against the atmospheric surroundings. 
The design pressure for this set is 35 atm. There is no reason why 
the pressure level should not be increased. As the diameters of 
the machines are small, the pressure of 70 atm would result in a 
useful output of the machine as per Fig. 8 of approximately 30 
mw. The free choice of the pressure level is one of the most im- 
portant design advantages offered by the closed-cycle plant in 
order to cut dimensions. 


Fie. 6 First Japanese Ciosep-Cycie Gas Tursine on Test Bep 


631 
| AW. = = 
|| 
: 
| 
: 
ip 
| 


| 


TRANSACTIONS OF THE ASME 


Fic. 7 Cross Section or New Tuco Macuinery Ser. Tursine aNnp CoMPRESSOR ON SAME SHAFT 


Fia. 7(a) 


No obstacle is seen in principle, therefore, from the design point 
of view in building closed-cycle gas turbines for 50 to 100 mw. 
The rotor dimensions of a 50-mw set with 35 atm working pressure 
(which can deliver 100,000 kw when the design pressure is 
boosted to 70 atm) are shown in Fig. 9. 


Atr-HEATER PROBLEMS 


The first air-heater tubes (Fig. 10) were fabricated from 25/20 
chrome-nickel heat-resisting (Thermax 11) steel strips which were 


Rotor or 2000-Kw Tuco MacuHINE 


bent and welded longitudinally. At that time (1937) no reliable 
figures on long-time creep tests were available so we conducted 
an investigation in our metallurgical laboratories. The most 
interesting creep tests were at 720 C under stresses and tempera- 
tures which occur in air-heater tubes (1400 F, 500 psia, Fig. 11) 
The results of a temporary involuntary overheating are shown f.i. 
in these curves. An unforeseen temperature rise of 100 C in 
the test apparatus to 820 C for 4 hr, when the tests already had 
been under way for 15 days, did no harm to the test specimens. It 


632 
~O» 


APRIL, 1957 


SOS 


lic. 8 Cross Section or Apvancep DesiGcn 15-20-Mw CLosep-CycLe Macuine Group. 


Buitt-In INTERCOOLER 


< 


SS 


Fic.9 Srupy ror 50-Mw Macuine Group Demonstrates SMALL Rotor DIMENSIONS 
1 From precoolor, LP compressor inlet. 
2 From intercooler I, IP compressor inlet. 
3 From intercooler II, HP compressor inlet. 


is evident that this high overload only produces a more rapid 
elongation. When normal test temperature is resumed, creeping 
continues normally on a somewhat higher level. No crack occurs, 
but rupture time is slightly reduced. This test shows that safety 
of air-heater surfaces is quite good even at an unexpectedly high 
degree of overheating because tube stresses are very low compared 
with blade stresses in open-cycle turbines. Long-time stress- 
rupture tests for different materials at the Escher Wyss labora- 
tories run up to 45,000 hours. About 450 test specimens are under- 
going tests. Short-time rupture stress is about 10 times higher 
than normal working stress and can be withstood for many hours. 


Another series of material tests is being conducted with real 
tube samples under internal pressure. With such tests we have 
now reached 30,000 hours at 750 C and 50 atm (700 psia) inside 
pressure, corresponding to full-load service. G 18 B tubes for in- 
stance showed a growth of only 2.4 per cent in diameter after 28,- 
000 hours; type 316 tubes 2.8 per cent in 20,000 hours. Rupture 
does not occur below elongations of 15 to 20 per cent. Wall stress 
is approximately 2 kg/mm? (2800 psi). Tubes which become over- 
heated do not crack suddenly. As Fig. 12 shows, a strong 
bulging action occurs many hours before cracking, which shows 
that the danger point is approaching. No explosion occurs. 


633 
17-58" 
| | 
| | 
| | Alt | 
a= SA 
3550 
9600 
d 
| my “LIC | 
3 
Tort 
will 
=X he Lis a 7 


Fie. 10 First Escher Wyss Arr Heater or 1939 Test PLANT 
SHOWING EXTERNAL ConvecTION TuBE Rops SuRROUNDING INTER- 
MEDIATE COMBUSTION-CHAMBER WALL 
[Radiation tube rows are inside intermediate wall. Combustion room diam 


120 em (4 ft), height 7 m (23 ft), specific load of combustion room 830,000 
keal/m? (93,000 Btu/cu ft).| 


Contrary to steam-boiler practice we can use small-diameter 
thin-wall tubes to build up the heating surface (approximately | 


in. diam, 0.1 in. wall). Drums are not necessary in modern air- 
heater design. Small tubes are welded in individual conical 
headers from which the working gas is led in a small number of 
collecting tubes to the final collectors (Figs. 13, 14). Each tube 
bundle is accessible and can be removed easily. The maximum 
tube-wall temperature never exceeds the working temperature 
of the inside gas by more than 50 C (90 F), due to the high 
heat-transfer coefficient inside (250 to 500 Btu/sq ft hr deg F). 


TRANSACTIONS OF THE ASME 


This has been checked with thermocouples in different air 
heaters as in St. Denis and Dundee. 

Thin-walled air-heater tubes of ferritic and austenitic steel 
have been welded together and this connection has proved to be 
reliable, even under 1000 times repeated thermal-shock tests be- 
tween 300 and 600 C (Fig. 15). 

The question of chemical attack by combustion gases has been 
given our greatest consideration. We began our first tests on 
vanadium attack at the close of 1945, for instance, and pointed 
out the influence of additives for high-vanadium-containing fuel 
oils in basic patents. One recommends raising the ash-melting 
point, the second the delusion effect, and another evaporation of 
the vanadium oxide. Also sulphur attack is being studied care- 
fully and has been found to be not dangerous for alloyed steel up to 
700-750 C. 

We believe that corrosion is not a chemical problem alone, but 
also can be combatted by suitable mechanical design. Therefore, 
in all our air heaters a radiation section is formed by straight 
vertical tubes, forming what is practically a cylindrical combus- 
tion chamber. Usually burners are at the top. This prevents ash 
deposits being built up on high-temperature tubes. Only when 
gases have cooled down do they reach cross-section tubes in the 
convection part of the air heater. The air heater approaches 
tube-still design philosophy more closely than it does that of the 
steam boiler, where similar conditions exist concerning tube 
stresses and temperatures. 

The latest oil-fired air-heater design by Escher Wyss for a sta- 
tionary power plant, where low space requirements are not of 
concern, has been built by our Japanese licensees (Figs. 16, 17). 
This design offers very good accessibility to the entire tube surface 
and should be suitable also for low-grade fuel. For marine plants 
much more concentrated designs with higher combustion chamber 
and specific tube-load figures can be accepted. For naval air 
heaters we have designs which require only 0.2 cu ft per hp. Such 
an air heater for 2 X 10,000-shp plant is being manufactured in 
Japan by one of our Japanese licensees (Fig. 18). 


Coa.-BURNING PLANTS 


Since the beginning of the development Escher Wyss has con- 
centrated much of its efforts on developing the coal-burning gas 
turbine. In collaboration with our licensees “Gutehoffnungs- 
hiitte (GHH) and ‘‘Kohlenscheidungs-Gesellschaft’’ (KSG), in 
Germany, we have designed and built the first 2000-kw plant 
which went into operation in Ravensburg, Germany, in De- 
cember, 1955 (Figs. 19, 20). This unit not only furnishes the en- 
tire electric power for an industrial plant of 1500 workers, but 
also delivers the necessary heating for the whole establishment 
and the workshops. Combination of power and heat production 
in the closed-cyele plant is very promising. Compared with the 
corresponding steam plant, the electric output per available heat- 
ing unit is higher. Furthermore, power production and heat 
production are independent of each other. Warm cooling water 
of the intermediate compressor coolers as well as the precooler of 
the cycle is available without affecting the power-production cy- 
cle itself at elevated temperatures of 80-110 C (175-215 F). 
The over-all efficiency in such a combination is about 60 per cent; 
i.e., about 30 per cent of the fuel energy is transferred into elec- 
trie power and an additional 30 per cent of the fuel energy is trans- 
ferred in useful process heat. An approximate scheme of the 
Ravensburg plant is given in Fig. 21. The pulverized-coal-fired 
air heater stands as an outdoor unit near the powerhouse (Fig. 20) 
as no danger of freezing exists in the air heaters. 

There is a great trend in Europe to combine power and heat 
production in thermal plants as a means of improving fuel 
economy. In many parts of Europe electric-energy costs are 
high (2-2'/. cents/kwhr) even when produced by large steam 


634 
4 ith : 
ON 
|| 


APRIL, 1957 


Spezrilische Dehnung in te 


Fic. 11 


Fic. 12 


Creep Test 1937 Heat-Resistinc Matertiat (25/20 Sree.) (T = 720C) 


Creep Tests Sampies or Air-Heater Tuses UNpeR INTERNAL PRESSURE 


Fic. 13 Inpivipvat Tuses or Rapiation Section Are Contca, 


units. This offers advantages for high-efficiency gas turbines and 
favors decentralization of power production. The smaller self- 
contained power units from 5-20 mw are of interest to many 
countries, especially Germany where both transmission of elec- 
tricity and coal-transportation costs are high. Two more coal- 
fired plants for isolated town works, one of 12,000 kw and one of 
6600 kw, combined with remote heating for the surrounding 
buildings, have been ordered recently and will come into opera- 
tion in 1958 as a result of the successful starting operation of the 
pilot plant in Ravensburg. 

In Fig. 22 the layout of a 6600-kw closed-cycle plant is given. 
It should be noted that the first 2000-kw coal-fired air heater was 


based on very conservative figures in order to gather experience. 
Compared with the corresponding steam boilers, the specific load 
of the combustion chamber was very low (125 X 10° cal/m* = 14 


x 10° Btu/cu ft). No difficulty was encountered with the coal- 
fired air heater. The height of the combustion chamber is dictated 
by the manner and timing of feeding the coal as required for perfect 
combustion, and depending somewhat on coal quality. Air heaters 
also will burn poor quality coal with as much as 35 per cent ash 
as well as brown coal. Burners are placed at the top of the 
cylindrical combustion cylinder. We realize that the first 2000- 
kw air heater is the lower limit of economical and technical appli- 
cation. Fig. 23 shows that the air heater now being built, having 


635 
th on 
rod 
Versuchsdaver in Tagen 
; 
| 


Fic. 14 Deram or Coat-Firep Arr-Heater Tuse System 
(Radiation part in foreground. Convection part in background.) 


more than 3 times greater output than the Ravensburg heater, has 
practically the same dimensions and a simplified tube arrange- 
ment. 

It should be noted that when steam is both the working and 
heating medium, the steam boiler must be designed for higher 
steam production than would be necessary for power production 
alone. Therefore, steam boilers for combined power and heat 


production get larger and more costly. The closed-cycle system 


provides hot water as waste heat from the cycle. Hence, the 
air heater for a power production plant or combined power-heat 
production plant, is the same. This fact helps to make such 
closed-cycle plants in this field economical. Modern closed-cycle 
plants using only moderate pressures from about 25-50 atm (350- 
700 psia) top pressure are much simpler in layout and operation 
than modern and complex steam cycles with high pressures and 
temperatures which are profitable only in very large units. 


PLANTS 


As the specific fuel consumption with the closed-cycle ship 
plant is practically constant over a wide range of load, the closed- 
cycle gas turbine offers special advantages compared with other 
machinery for naval vessels. Special features of the ship-propul- 
sion closed-cycle plant have been discussed in detail in a recent 
paper by Keller and Spillmann (7). 

The ideal propeller to go with gas turbines is the automatically 
controlled variable-pitch propeller. Recent experiences of 
Escher Wyss show that such propellers can be built without dif- 
ficulties up to 10,000-15,000 shp, based on the experience with 
Kaplan turbines. In the past two years an increasing number of 
such propellers was built. Prejudice seems to have been over- 
come by successful demonstration of a great number of recent in- 
stallations, especially in France and Germany. We feel that 
variable-pitch propellers up to 20,000 shp can be built safely. 

Another solution for the propeller drive of a gas turbine is the 
radial-inflow turbine. By variable guide vanes the sense of rota- 
tion of the fixed propeller can be changed. Owing to the elevated 
pressure level of the closed cycle, the dimensions of the power tur- 
bine can be kept small (Fig. 24). One turbine resembling the 
Francis-type water turbine can produce 10,000 shp at 10‘ rpm with 
not more than 710 mm diam. For ship plants the closed-cycle 
turbine is divided into a compressor-turbine group, similar to the 


TRANSACTIONS OF THE ASME 


Fig. 15 Avustenitic/Ferritic Tuse Sections UNDER 
THERMAL SHock Tests 


Tuco design, and an additional independent propeller turbine. 
This second turbine receives the exhaust gas from the H-P 
compressor-turbine at a relative low pressure and temperature, 
so that no special design conditions exist. 


Atomic Power PLANTS 


The new simplified machine designs have been developed with 
a view to using the same basic elements both for fossil-fuel-fired 
plants and future atomic-energy installations. At the beginning of 
our development work we pointed out the unique property of the 
closed-cycle system to use different gaseous working media. In 
1945 a scheme for using this cycle in combination with a gas- 
cooled reactor was studied, but time was not ripe then for that 
proposal, owing to the lack of reactor technology for high-tem- 
perature reactors (Fig. 25). Several patents were applied for, 
using helium, nitrogen, or helium-CO, mixtures and proposing 
different working schemes. Escher Wyss is pleased to learn that 
recently these proposals are being reconsidered. The basic ideas 
were first published in the United States by R. T. Sawyer (8), but 
did not become widely known. An old internal Escher Wyss re- 
port written in 1945 is becoming up to date again now, and reads 
as follows: 

The physical characteristics, for example of helium or mixtures 
with carbon dioxide are such, with regard to the heat transmis- 
sion, that for the same losses of pressure and the same absolute 
pressures a heat transmission results which is about three times 
greater than is the case of air. 

Thus, for otherwise similar conditions, the heating surfaces in 
a helium heater could again be considerably reduced compared to 
the air heater. However, if the same peripheral speeds are to be 
employed, light gases call for the adoption of more stages in the 
machines. In this connection it is worthy of note that helium, for 


636 
th 
: 
‘ 
po 
, 73471 
: 
H 


APRIL, 1957 


instance, has a sonic velocity which is approximately three times 
greater than that of air at the same temperatures. In principle, 
this would permit of the admissible circumferential speeds of 
turbomachines being increased, since they are dependent by no 
small degree on the closeness ‘of the approach to the velocity of 
sound. 

Moreover, the possible increase of the peripheral speeds in the 
event of helium being adopted, again leads to a reduction in the 
number of stages, so that in this connection also the construc- 
tional prerequisites are not unfavorable, which incidentally al- 
ready has been shown by corresponding investigation carried out 
by Professor Ackeret. 

If one compares the project of a gas-turbine nuclear power plant 
which was designed by Ackeret in November, 1945 (Fig. 26), 
with the modern aspects of such plants, the preview of future 
realization possibilities even for reactor dimensions and control 


Fic. 17 Japanese-Burtt Arn Heater ror a 2000-Kw PLant, SEB 
Fic. 6 


‘14. 16 Cross Section or 


Firep Arr-HEATER 


radiant section 


Y ESCHER WYSS A 1574 ( ombustion air pre heater 
Combustion air in 


Combustion air out 
Burners in top of air heater 


Hf : Combustion chamber and 


637 
| | 
& 
| 
i 
| /Al ear. 
| © Ter 


Fic. 19 Macuine Set or Fixst Coat-Firep 2300-Kw PLant 


{Left to right: Generator with starting motor, Tuco set (turbine- 
compressor set), precooler used as water heater for central heating. | 


details was not bad. We believe this document to be of histori- 
cal interest to engineers. 

Today’s development in elevated-temperature gas-cycle reac- 
tors bears out the feasibility of such ideas by the impressive work 
done in the AEC-laboratories and private industry. As revealed 
by the recent ASME Nuclear Gas Turbine Symposium, Wash- 
ington, D. C., December, 1955, (a) high-temperature fuel elements 
will be available and, (b) it is possible to maintain high heat flux 
in small-sized, low-pressure-drop reactors when highly pres- 
surized gas is used. 

We realize that a number of engineering problems have to be 
solved before building reliable and safe plants. The require- 
ment of tightness for such a plant necessitates special construc- 
tion, but such matters are involved with all other systems too. 
Helium or helium mixtures with other gases might be the ulti- 


TRANSACTIONS OF THE ASME 


Fic. 18 Cross Section Navat Air 
HeaTeR FoR 10,000-SuHp PLANT 
Air inlet from heat exchanger 400 C 
Primary convection section 
Combustion chamber 
Secondary convection section 
Air outlet to H-P turbine 675 C 
Combustion-air preheater 
Oil inlet for burner 


Fic. 20 View or First InpustriaL Coat-Firep Gas-TuRBINE 
PLant—2300 Kw. StTartrep OPERATION IN 1956 
(Left side, powerhouse; right side, air heater.) 


mate best solution for the working medium, but recent studies of 
American physicists and engineers show that pure nitrogen is 
also attractive from the economical and the machine-design 


point of view. Machines and heat-exchangers are practically 
the same for compressed air or compressed nitrogen. Therefore, 
all air-machine experience can be applied directly to nuclear 
power machinery. Design studies for normal and large nu- 


638 
7 ' 
| | 
—| | 
| | | 
\ | | 
| | | | 
| a 
| Ate ATS: il) im, 
= 
— 


APRIL, 1957 


4 


T 
| 


44 


| 

| 


| 


Layout or Moprern 6600-Kw CLosep-Cycte Wirn PuLverizep- 
Coa.-F 1R1NG 


Tuco set 

Precooler 

Generator 

Air-heater radiation section 
Air-heater convection section 
Coal bunker 

Coal mill 

Burner 

Combustion air preheater 


639 
4 
~ T 
N 
| 
W 
Bll 
| 
= — >. 
=p 
uy, =) 
Win 
7 ‘ial 
| 
| 
| 4 
; | 
a! N } 
\ N 
| 


640 


clear power plants up to 60 mw have been published recently by 
the AEC (10). Helium as the working medium requires more 
complex and multistage machinery. We actually are engaged in 
studying new design prospects for plants using gases other than 
air. Gas mixtures promise favorable solutions to machinery 
simplification because they lead to a reduction of turbomachine 


stages. 


7 


Keal /h 


140°C 


Scueme or ComBinep CLosep-CycLe PLant ror Power 
PropucTion AND CENTRAL HEATING 


Fia. 21 


TRANSACTIONS OF THE ASME 


American Turbine Corporation (4) and Escher Wyss have been 
developing designs particularly adapted to United States require- 
ments. Both fossil-fuel-fired and nuclear-reactor heat sources 
have been considered but the simplicity and the high-efficiency 
of the closed-cycle power-plant system make this plant es- 
pecially desirable for the nuclear application. Accordingly, a 
10 to 15-mw set has been designed which meets the require- 
ments for a wide range of possible applications. The cross sec- 
tion of a turbomachinery set is shown in Fig. 27. Axial ma- 
chinery was chosen for our plants above 10 mw because of its 
compactness and high efficiency. Pure nitrogen is used as a 
working fluid when employed in a nuclear plant; obviously air 
could be used when heat is supplied by fossil fuels. 

One proposal employs a single, closed, nitrogen circuit, in 
which the working fluid passes directly through both the 
graphite-moderated reactor and the gas-turbine. A chemically 
fired air heater is to be installed for initial operation of the tur- 


Closed cycle Aur Turbine 
N 20000 SHP 

Pe. 

G kgs 


Open cycle GT 


Naval plant 


InrLow TuRBINE FOR ELEVATED-PRESSURE LEVELS 
as Usep 1n CLosep-CycLe PLant 


24 


Fic. 23 Comparison or 2000 anp 6600-Kw Coa.t-Firep Arm HEATERS 


= | | 
675°C 
® ® 
415 ata | — 
| 15°C | 6600 pm 
| 50°C) 5 | 
| | 
| 
12300 thy 
| 
y, 
\ | 
Fa IANS \ |) 
| | | | Y J 
i 


HOLOVaY 


BOLOVEY MJY OZ INVIg AVATOAN 4O ‘IvBOdOUT 9% “OMT 400] INQ NI GNIGUAY SVD 


YOLOVAY AIOAD-SBVH V 4O AWAHOG (90090) CZ ‘DIY 


. APRIL, 1957 641 
| pall 
| 
“EE 
3 
§ Be 58 
8 


TRANSACTIONS OF THE ASME 


\ \ 
\ 


\ 


Md, 


\ 


| 


Fic. 28 Layout or 15-Mw C.Losep-Cycie Gas-Tursine Power PLant 


642 
: 
Wwe 
Fig. 27. Cross Section or 15-20-Mw C.Losep-Cycie Macuine Set ror Use With NITROGEN or AIR 
EATER 
PLAN 
| (aT 
| ine 
| 
SECTION 


APRIL, 1957 


bine before the reactor is placed in service. This heater will 
serve as a standby source of heat for the nuclear plant during 
periods of reactor experimentation or maintenance. Machinery 
(Fig. 27) is similar to that of Fig. 8 but utilizes a separate inter- 
cooler rather than an integral one. A typical layout incorporat- 
ing this machinery is shown in Fig. 28. The simplicity and 
compact design of this 15-mw plant having over 30 per cent 
The useful 


efficiency (at generator terminals) should be noted. 


output and the work of compression are produced in a single tur- 
bine of 2 ft 6 in. OD, where nitrogen expands from 503 psia 
(1290 F) to 130 psia (325 F). 

The use of high-pressure gas having good thermodynamic and 
nuclear properties results in a reactor of relatively small dimen- 


sions with small pressure losses. 


BIBLIOGRAPHY 


1 “The Escher Wyss—AK Closed-Cycle Turbine, Its Actual De- 
velopment and Future Prospects,’’ by Curt Keller, Trans. ASME, 
vol. 68, 1946, pp. 791-822. 

2 ‘“‘Closed-Cycle Gas Turbine, Escher Wyss—AK Development 
1945-1950,”’ by Curt Keller, Trans. ASME, vol. 72, 1950, pp. 835-850. 

3 ‘The Closed-Cycle Gas Turbine Power Plant,” by 8. T. Robin- 
son, ASME Paper No. 52-A-137. 

4 List of Licensees—end of 1956: 

Great Britain—John Brown & Company (Clydebank) Ltd., 
Clydebank. 
American Turbine Corporation, New York, 
(Licensors of Escher Wyss in U. 8S. A.). 
Westinghouse Electric Corporation, East 
Pittsburgh (Pa.). 
Nordberg Manufacturing Company, Mil- 
waukee, Wis. 


United States 


643 


ALG Allgemeine 
schaft, Berlin. 
GHH Gutehoffnungschitte Sterkrade Ak- 
tiengesellschaft, Werk Sterkrade, Ober- 
hausen-Sterkrade. 

Friedrich Krupp, Aktiengesellschaft, Essen. 
Mitsubishi Zosen K.K., Tokyo. 

Mitsui Shipbuilding & Engineering Co. 
Ltd., Tokyo. 

Fuji Denki Seizo K.K., Tokyo. 

5 “Conte rendu des essais officiclles.”’ by H. Quiby, Revue 
Politechnique Suisse, vol. 125, no. 23/24, June, 1945; also, Oil En- 
gine, vol. 13, November, 1945, pp. 184-191. 

6 “‘Einige Probleme der warmfesten hitzebestandigen St&ahle 
vom Standpunkt des Verbrauchers,”’ by W. Stauffer, Schweiz. Archiv 
fiir angewandte Wissenschaft und Technik, heft 12, vol. 17, 1951, pp. 
353-364. 

‘*The Scaling Behavior of High-Strength Heat Resisting Steels in 
Air and Combustion Gases,’’ by W. Stauffer and H. Kleiber, Journal 
of the lron and Steel Institute, vol. 156, 1947, pp. 181-188. 

7 “Closed-Cycle Air Turbine Installation for Marine Propul- 
sion,’ by Curt Keller and W. Spillmann, Oil Engine and Gas Turbine, 
vol. 21, 1953, pp. 317-319; January, 1954, pp. 355-358. Also, 
“Beispiele geschlossener Heissluft-Turbinenanlagen far Kriegs- und 
Handelsschiffe,’"’ Jahrbuch der Schiffbautechn. Gesellschaft, Ham- 
burg, Germany, Bd. 48, 1954. 

8 ‘Applied Atomie Energy," by Tom Sawyer, Prentice-Hall, 
Inc., New York, N. Y., 1946, p. 141. 

9 ‘Nuclear Reactors for Large Gas Turbines,"’ by T. Jarvis, and 
“The Closed-Cycle Nuclear Gas Turbine Power Plant,"’ by 8. T. 
Robinson, ASME Symposium on Nuclear Gas Turbines, Washington, 
D.C., 1956. 

10 “Design Study 60 MW Closed-Cycle Gas Turbine Nuclear 
Power Plant,’”’ by 8S. T. Robinson, American Turbine Corporation, 
published by AEC, December, 1954. 


Germany Elektricitaets-Gesell- 


Japan 


t 
~ 
¢ 


The use of turbine cooling in gas-turbine engines can 
offer many performance benefits but, at the same time, it 
may result in added complication to the engine. The 
advantages that turbine cooling can offer to the engine 
designer and the results of some of the research that has 
been expended on the cooling of gas-turbine engines are 
discussed. 


INTRODUCTION 


r I YHE application of turbine cooling to gas-turbine-type air- 
craft engines permits increases in turbine-inlet temperature 
to the point where engine power can be increased greatly. 

For some applications the specific fuel consumption of the engine 

also can be improved. Furthermore, the use of cooling will per- 

mit increased allowable stress levels. As a result, it is possible to 
increase the mass-flow rate through the turbine and, in all 
probability, the turbine can be made more reliable. On the other 
hand, the use of cooling introduces more complications into the 
engine design and, for some modes of flight operation, high tur- 
bine-inlet temperature causes poor fuel economy. Because it is 
necessary to weigh the advantages and disadvantages of turbine 
cooling as well as to consider many modes of flight and engine 
operation, there is often considerable disagreement among en- 
gine designers as to whether turbine cooling is really worth while. 
The latest discussion presented to this Society on turbine cool- 

ing by a member of the NACA staff was by O. W. Schey in 1948 

(1).4 The present paper will try to show, within security limita- 

tions, when it is desirable to utilize cooling in gas-turbine engines 

and will present some of the results obtained at the Lewis Flight 

Propulsion Laboratory from research directed towards the ap- 

plication of cooling to gas turbines since the Schey paper (1) was 

published. 


BeENEFITs From TuRBINE CooLina 


Cooling of the turbine blades and disks by means of either air or 
liquid results in degrees of freedom in engine design and operation 
not possible with conventional types of engines. Some of the 
things made possible through use of turbine cooling are as follows: 

Increased Turbine-Inlet Temperature. The increase in turbine- 
inlet temperature made possible by cooling the turbine causes 
higher tail-pipe jet velocities and thus greater thrust for turbojet 
engines and increased turbine-shaft power for engines such as the 


1 Associate Chief, Turbine Cooling Branch, NACA Lewis Flight 
Propulsion Laboratory. 

? Head, Section A, Turbine Cooling Branch, NACA Lewis Flight 
Propulsion Laboratory. 

* Head, Section B, Turbine Cooling Branch, NACA Lewis Flight 
Propulsion Laboratory. 

* Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by the Gas Turbine Power Division and presented at 
the Semi-Annual Meeting, Cleveland, Ohio, June 17-21, 1956, of Tue 
AMERICAN Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, May 4, 


1956. Paper No. 56—SA-54. 


Research on Application ot Cooling 
to Gas Turbines 


By J. B. ESGAR,!' J. N. B. LIVINGOOD,? ano R. O. HICKEL,? CLEVELAND, OHIO 


645 


turboprop. For either type of engine, the power per unit of 
engine weight or per pound of air flow can be increased sub- 
stantially. This can be seen in Fig. 1. These performance 
curves were calculated by the methods of (2) for a flight Mach 
number of 0.9 and an altitude of 40,000 ft. A flight Mach number 
of 0.9 is low for many turbojet-engine applications and is some- 
what high for present turboprop-engine applications. This in- 
termediate flight Mach-number value was selected, however, so 
that the effects of turbine-inlet temperature on engine per- 
formance could be made at a single flight condition that is 
reasonable for each engine type. 


2.6r 1.0 2.2r 
AF TERBURNING 
| TURBOVET 
re) | 1.0 32 1.0 
5 
TURBOJET NONAF TERBURNING 
5 
NONAF TERBURNING 1.0 
1.0F TURBOJET 
1500 2000 2500 1500 2000 2500 
TURBINE INLET TEMPERATURE, °F 
Fie. 1 Errecr or Turpine-INLet TEMPERATURE ON RELATIVE 


Spreciric Turust anp Turvust Speciric-Furt Consump- 
TIon. Errects or Ain Cootina INcLupEp. NuMBER, 
0.9; AvtirupeE, 40,000 Fr 


The performance shown is relative to that of a nonafterburning 
turbojet engine with a 1540 F turbine-inlet temperature. All en- 
gines were assumed to have a compressor-pressure ratio of 12 and 
compressor and turbine adiabatic efficiencies of 0.85. For the 
afterburning turbojet engine, the gas temperature at the tail-pipe 
nozzle was assumed to be 3000 F. In order to compare all engines 
on the same basis, the shaft power of the turboprop engine was 
converted to thrust by assuming that a propeller thrust was 
added to that of the jet thrust. The effects of air-cooling the tur- 
bine stator and rotor blades to a metal temperature of 1240 F are 
included in the performance results shown. The cooling air was 
assumed to be bled from the discharge of the compressor. After 
cooling the blades, the air was considered to be mixed with the 
exhaust gases downstream of the turbine. A blade-cooling effec- 
tiveness parameter ¢ is used to show air-cooling effects on per- 
formance. This parameter is defined as 


where 
Ta. in = cooling-air inlet temperature at blade base 
Ta, out = cooling-air outlet temperature at blade tip 
Ts, = average blade temperature 


The blade-outside heat-transfer coefficients required to obtain 
heat-rejection rates were found by the methods of (3). The value 


646 


of ¢ for the better air-cooled turbine blades approaches or exceeds 
a value of 1.0. (The reason a value of 1.0 can be exceeded is that 
the blade temperature near the blade tip is higher than the 
average blade temperature.) Turbine blades that would have 
values of ¢ = 0.5 would be of a poor cooling design. 

From Fig. 1 it can be seen that the thrust of turboprop and 
nonafterburning turbojet engines can be increased greatly by in- 
creasing turbine-inlet temperatures. For the turboprop engine 
there is only a slight increase in specific fuel consumption. The 
specific fuel consumption for nonafterburning turbojet engines 
increases more with turbine-inlet temperature, but it is still more 
economical than using an afterburner. Afterburning to 3000 F 
results in the highest thrust by far for the three types of engines 
at low turbine-inlet temperatures, but the specific fuel consump- 
tion is very high. The use of higher turbine-inlet temperatures 
reduces the specific fuel consumption of afterburning engines and 
at the same time results in increased thrust. 

From a performance point of view, therefore, it appears that 
there are no disadvantages to increasing turbine-inlet tempera- 
tures for turboprop and afterburning turbojet engines. The cool- 
ing effectiveness of the turbine blades should be high, however, 
for the turboprop to get the greatest increases in power and, at the 
same time, not cause the specific fuel consumption to increase. 
For nonafterburning turbojet engines, Fig. 1 shows that in- 
creases in relative specific thrust are accompanied by marked 
increases in relative specific fuel consumption. The rate of in- 
crease in relative specific thrust, however, is about twice that of 
the increase in relative specific fuel consumption. As will be dis- 
cussed later, higher turbine-inlet temperatures at supersonic- 
flight Mach numbers can be desirable for nonafterburning engines 
when considered along with aircraft performance. 

The same general trends as those shown in Fig. 1 would be ob- 
tained with liquid cooling. A study of effects of liquid cooling on 
the efficiency of a turboprop-type engine is presented in (4). 

Increased Turbine Stress Levels. Since engine power is directly 
proportional to the gas-flow rate, it is desirable to increase the 
turbine-flow capacity by use of longer turbine blades. Longer 
blades, in turn, result in higher turbine-blade stresses. The rela- 
tionship between allowable blade stress and blade temperature in 
Fig. 2 shows the 100-hr stress-rupture properties for three alloys. 


ALLOY % CRITICAL 


MATERIAL 
816 97 

A-286 46 

— 25 


120 «10° 


/ 


100-HR STRESS-RUPTURE, PS! 


j 
1600 


i 
1200 1300 1400 1500 
TEMPERATURE , °F 


Srress-RuptureE PROPERTIES FOR AiR-COOLED 
TuRBINE-BLavDE 


1000 1100 


Fic. 2 


The upper levels of the curves are terminated where stress-rupture 
properties no longer determine the permissible stress. The alloy 
8-816 is commonly used in present gas-turbine engines. At a 
temperature of 1500 F (about standard blade temperature for 
present engines), the maximum allowable stress is 24,000 psi. If, 
however, the temperature is reduced only 100 deg F by cooling, 
the allowable stress can be increased by about 35 per cent, with 


TRANSACTIONS OF THE ASME 
further increases obtainable at lower temperatures. Below tem- 
peratures of about 1200 F, however, other materials such as 
A-286 possess better strength properties, with the possibility of 
operating at stresses over 90,000 psi—over 3'/2 times the allowa- 
ble stress for present engines. As a result, turbine-gas flow 
capacity can be increased greatly. Turbine work capacity also 
can be increased by operating at higher wheel speeds that are 
possible with increased allowable stresses. Further reduction in 
temperature makes possible the utilization of high-strength steels 
such as Timken 17-22A(S). This type of material offers only 
slight increases in possible operating stress over A-286, but the 
critical material content of 17-22A(S8) is eliminated almost com- 
pletely. 

Increased Turbine Reliability. The design of turbine blades is 
unique in machine or structural design in that little or no factor of 
safety is employed. The blades are designed on a basis of stress- 
rupture properties for some specified life, which may be only 
from 100 to 300 hr with a design safety factor of 1.0. This low 
safety factor occurs because generally it would be desirable to 
operate engines at higher turbine-inlet temperatures than is 
possible without cooling to improve engine power. As a conse- 
quence, it has been necessary to operate uncooled engines at tem- 
peratures as high as possible, with a resultant sacrifice, to a con- 
siderable extent, in reliability. Cooling provides a means of in- 
creasing the factor of safety in turbine blades without lowering 
turbine-inlet temperatures and sacrificing engine performance. 
Fig. 3 shows the stress-rupture properties of A-286 for lives of 100 


100,000 
(00 HOUR LIFE 
— |000 HOUR LIFE 


| 
80,000+ 
60,000} 


40,000 


RUPTURE STRESS, PSI 


20,000 


1 j 
1200 1300 1400 1500 
TEMPERATURE, °F 


A-286 ALLoy 


= 
1000 1100 


Fic. Srress-Ruprure Properties oF 
and 1000 hr. <A tenfold increase in life can be obtained for blades 
made of A-286 by decreasing the blade temperature from 50 to 90 
deg F (depending on the stress level) through use of turbine cool- 
ing. By such design methods, engine life and reliability could be 
increased substantially. 

A word of caution should be injected at this point, however. 
Some cooled-turbine research has been conducted which indicates 
that considerable potential for increased reliability exists, but 
much more work is required to verify this in extended actual en- 
gine and flight operation. Cooled turbine blades are constructed 
in a manner completely different from those blades that operate 
in current engines. As a result, considerably more blade develop- 
ment will be required before definite statements regarding im- 
provements in engine reliability can be made. 

Improved Aircraft Performance. One method of evaluating air- 
craft performance is by the Breguet range equation which can be 
written 


L 
D loge 


s 
x 
s 
| 
| 
80r 
~ 
60+ 
L 
40 \ SS 
~ 
20+ 


APRIL, 1957 
Equation [2] can also be written as 


L 
Ren D log. W 


where 


= range 
over-all engine efficiency 
lift-drag ratio 
initial airplane gross weight 
initial fuel weight 
pay-load weight 
structural weight 
thrust specific engine weight 


So far as the engine is concerned, two terms in Equation [3] 
affect aircraft range; namely, the over-all efficiency 7 and the 
thrust specific engine weight W,/F. At a given flight speed the 
over-all efficiency is inversely proportional to the thrust specific 
fuel consumption. In general the over-all efficiency increases with 
increasing flight speed. The aircraft range improves with in- 
creasing over-all efficiency and decreasing thrust specific engine 
weight. 

The use of turbine cooling to increase stresses so that the 
mass-flow capacity of the engine can be increased will probably 
result in reduced specific engine weight. This would result in a 
beneficial effect on range and could be obtained without increas- 
ing turbine-inlet temperature. 

From Fig. | it can be seen that for a nonafterburning turbojet 
engine the over-all engine efficiency will decrease as turbine-inlet 
temperature is increased, but at the same time the thrust specific 
engine weight also decreases because of the large increases in 
thrust. There is a counterbalancing effect since both the ef- 
ficiency and the specific engine weight decrease simultaneously 
in nonafterburning engines. Which effect will finally predominate 
results in a lengthy study and will vary with different types of air- 
planes and engine designs. Without going into the details, it can 
be stated that at high supersonie-flight speeds and at very high 
altitudes, high turbine-inlet temperatures result in increased air- 
craft range, because at these conditions the reduction in specific 
engine weight more than outweighs the effect of decreased ef- 
ficiency. 

It is obvious from Fig. 1 and Equation [3] that aircraft per- 
formance where turboprop or afterburning turbojet engines are 
used can be improved by increasing turbine-inlet temperatures. 
Performance gains, however, for afterburning engines are mar- 
ginal for turbine-inlet temperatures above about 2000 F. At 
higher flight Mach numbers slightly higher turbine-inlet tempera- 
tures are desirable. 


DesiGn CONSIDERATIONS FOR COOLED TURBINE ENGINES 


In order to capitalize to the fullest extent on the performance 
benefits possible through use of turbine cooling the basic engine 
design should be different from the conventional practice in un- 
cooled engines. This means therefore that adapting uncooled 
engines so that the turbines can be cooled and the turbine-inlet 
temperature can be raised will not produce engines of the high 
performance possible if cooling is considered in the original con- 
cept of the engine design. 

Turbine-stress limitations encountered in uncooled engines are 
not valid for cooled engines. Because of much higher allowable 
stresses the turbines can be rotated faster and the blades made 
longer to permit use of higher turbine power and higher flow 
capacity for a given engine size. The compressor flow, size, and 
rotational speed are in turn affected by these possible changes in 
the turbine. By proper design the total engine power and 


647 


specific engine weight can be improved to a greater degree, by 
building in all the advantages possible through use of turbine 
cooling, than is possible by increasing turbine-inlet temperature 
alone. 


APPLICATION OF CoOLING TO TURBINES 


From the preceding discussion, it can be seen that it would 
often be quite desirable to use turbine cooling to permit higher 
turbine-inlet temperatures or high stresses for both turbojet and 
turboprop engines. An evaluation of several means of turbine 
cooling will be discussed next to show how the turbine can be made 
to withstand these conditions. 


COOLANT FLOW.. 


| POTOR BLADE 


STATOR BLADE 


0000 


COOLING 


(A) CONVECTION 


Bu (D) SPRAY COOLING 


(B) FILM COOLING 


COOLING 


'C) TRANSPIRATION COOLING 1E) NATURAL CONVECTION COOLING 


Fic.4 Exampves or Severat Cootinc Meruops 


Methods of Blade Cooling. Fig. 4 shows three methods of air 
cooling and two methods of liquid cooling. The most conven- 
tional air-cooling method used in heat-transfer processes is con- 
vection-cooling, Fig. 4(A). With this method, it is desirable to 
increase the heat-transfer-surface area on the heat-rejection side 
of the apparatus. The fins shown in Fig. 4(A) serve this purpose. 
This method of cooling has been used successfully on air-cooled 
piston engines for many years. Film cooling is illustrated in Fig. 
4(B)._ A film of cool air is introduced through slots to form an in- 
sulating layer between the hot gases and the cooled surface. The 
thermal conductivity of air is very low so that it is a good insula- 
tion medium, but the effectiveness of the layer of air is lost some 
distance downstream of the slot because of the mixing of the cool- 
ant with the hot gases. This disadvantage is eliminated by 
transpiration cooling, Fig. 4(C), because air is passed continuously 
through the entire area of a porous surface. Transpiration cooling 
is the most effective method of air-cooling known at the present 
time. A comparison of the effectiveness of these three methods of 
cooling is given in (5). 

Fig 4(D) shows a form of liquid cooling where the liquid is 
sprayed into the gas stream so that it will impinge upon and cool 
the rotor blades. With this cooling device the liquid used for 
the coolant is lost. A closed-type liquid-cooling system where the 
coolant is recirculated also can be employed. This system is 
similar to that used in automobiles. Several methods can be used 
that will result in coolant circulation within the blades without 
special coolant pumps. A natural-convection cooling system is 
illustrated in Fig. 4(£). The natural-convection circulation can be 
compared to that in a home hot-water tank where the heated water 
rises to the top of the tank and the cooler water sinks to the bot- 
tom. With the high centrifugal-force fields that are set up in tur- 
bine rotors, the circulation rates due to natural convection be- 
comes very high. The liquid coolant can be circulated within a 


W/F 
Ww, L/D 
R 
L/D 
W, 
Ww 
W,/F 
| 
BOUNDARY LAYER : | 
‘ 


648 


single blind hole. In such a system, relatively cool liquid is forced 
radially outward through the central portion of the hole by centrif- 
ugal force. As the coolant near the wall is heated, its density be- 
comes less than that of the cool liquid core and it flows radially in- 
ward. In a system where crossover holes are used, the coolant 
would flow out one hole, cross over at the tip, and flow inward in 
the next hole. 

Air-Cooled Turbine Blades. Air-cooled turbine blades of about 
2-in. chord utilizing the air-cooling methods shown in Figs. 4(A) to 
(C) are illustrated in Fig. 5. The hollow blade was used by the 


my, 


(A) HOLLOW (E) COPPER CLAD SHELL 


(B) TUBE FILLED 


(C) PERFORATED 


(0) FILM COOLED LEADING 
AND TRAILING EDGES 


(H) TRANSPIRATION COOLED 


Fie. 5 Atr-CooLtep Tursine-BLapE CONFIGURATIONS 


Germans in some of their engines in 1945. A survey of their work 
on cooling turbojet engines and turbosuperchargers is given in (6) 
to(8). The cooling effectiveness of the hollow blade is so low that 
excessive quantities of cooling air are required; consequently, 
efforts were made to provide added internal heat-transfer surface 
area. The tube-filled blade, Fig. 5(B), was an early attempt of 
the NACA to provide this extra surface area. Some results of 
tests on and methods of manufacturing this type of blade are 
given in (9) to (13). Although more recent blade developments 
have led to superior blade configurations, much valuable informa- 
tion has been obtained from the tube-filled configuration. 

The British (14) have used a somewhat different method of ap- 
proach to the problem of adding internal surface area. Instead of 
packing a hollow shell with tubes and brazing the assembly to- 
gether, holes were provided near the periphery of solid blades, 
Fig. 5(C), by drilling or other methods. Herein this type of blade 
will be called a perforated blade. 

Cooling of the leading and trailing edges is often difficuit with 
tube-filled blades. In an attempt to improve cooling effectiveness 
in these regions, film cooling, Fig. 5(D), and copper-clad shells, 
Fig. 5(£), were investigated. The copper-clad blade is a type of 
structure similar to that of copper-clad kitchen utensils in which 
the copper spreads the heat over the entire area of the utensil. 
In the case of cooled turbine blades, the copper cladding is on the 
inner surface of the blade shell. Results of heat-transfer tests on 
film-cooled and copper-clad blades are reported in (15) to (17). 

The effects on reduction of temperatures of turbine-blade lead- 
ing and trailing edges are shown by the experimental data in Fig. 
6. It will be noted that the blade-temperature gradients can be 
reduced by use of film-cooled and copper-clad blades, but the lead- 
ing and trailing-edge cooling is done at the expense of increased 
temperatures in the midchord region of the blades. The film- 
cooled blade shown operated only slightly cooler at the leading 
edge than did the 10-tube blade. The amount of cooling ob- 


TRANSACTIONS OF THE ASME 


tained in this region is a function of the configuration and the 
relative pressures on the inside and outside of the blade. At 
higher flow rates this cooling becomes more effective because a 
greater portion of the air is bled from the slots. The blade must 
be designed specifically for the application for which it is intended 
and the coolant supply pressure must be higher than the stagna- 
tion pressure at the blade leading edge. More effective leading- 
edge cooling has been found in other film-cooled blades. Durabil- 
ity of this type of blade was found to be a serious problem as re- 
ported in (12). Research was conducted in Germany on blades 
having film cooling around the complete periphery (18), and simi- 
lar blades have been built in this country. Although cooling is 
adequate for some cases, durability of rotor blades is usually poor. 
This type of blade probably would be more successful when ap- 
plied to stators, provided that cooling-air pressure is sufficiently 
high. 

Although the cooling of the copper-clad blade looks better 
than for the film-cooled blade in Fig. 6, the weight increase due 


FILM COOLED 


COPPER CLAD 


= 
< 
a 
= 
= 
w 
< 
of 


4 —L 
40 60 80 

HORDWISE DISTANCE 
PERCENT 


100 
TRAILING 
EDGE 


Errect or Speciat Metuops or Coo.tinc LEADING AND 
Epces on TEMPERATURE 


Fic. 6 


(Engine speed, 10,000 rpm; effective gas temperature, 1070 F; cooling-air 
temperature, 130 F; ratio of cooling-air flow to combustion-gas flow ratio, 
0.045.) 


to the addition of copper raises the stress for rotor blades so much 
that the gains from cooling are practically eliminated. In addi- 
tion, at the blade temperatures required in engine operation, cop- 
per oxidation is rapid and severe. 

A better method of reducing chordwise temperature gradients 
fgr rotor blades is to utilize thinner blade shells. The leading and 
trailing edges would then have larger cooling-air passages which 
would insure an adequate supply of cooling air in those regions. 
Configurations that can extend the augmented heat-transfer sur- 
face well into the leading and trailing edges also should be used. 
Such a blade, Fig. 5(F), is one with corrugated fins (11). An 
island is usually provided in the middle of the coolant passage so 
that the corrugations can be of uniform amplitude. The core of 
the island is blocked off from the cooling air. 

In all the turbine blades discussed up to this point, with the 
possible exception of the perforated blade, the blade shell has been 
the primary member for carrying the stresses due to centrifugal 
forces. Since the shell is exposed to the gas stream, it is also the 
hottest member of the blade; therefore its stress-carrying ca- 
pacity is lower than that of cooler portions of the blade. For this 
reason it seems practical to design blades where the main stress- 
carrying member, or strut, is submerged inside the coolant pas- 
sage (19), where it will be at a lower temperature than the blade 
shell, Fig. 5(@). This type of construction can be used with 


(F) CORRUGATED INSERT 
G0 1000 10 TUBE 
= 900 
(G) STRUT SUPPORTED —_ 
\ 
=> . SN 
00) 
600} 
| 
500} 
} 


APRIL, 1957 


either an impermeable or a permeable shell, in which case the 
latter blade is called transpiration-cooled, Fig. 5(H). More of 
the details of this type of construction are illustrated in Fig. 7. 
The porous shell could be made from several materials, the 
most probable being woven-wire 
cloth (20) to (22) or porous sin- 
tered materials made from pow- 
dered metal (23). 

The design and manufacture of 
transpiration-cooled blades are 
somewhat difficult because the 
coolant-flow rate is a function of 
permeability of the shell and the 
pressures inside and outside of the 
blade. Because of the high-pres- 
sure gradients on the outside of the 
shell, large variations in shell per- 
meability are required. Some of 
the advantages and problems of 
transpiration cooling are discussed 
in (24), and methods of account- 
ing for rotational effects on cool- 
ing-air distribution are given in 
(25). 

Many analytical studies have 
been made on methods of cal- 
culating cooling-air requirements 
for the various types of blades shown in Fig. 5. Calculated 
coolant flows for five of the blades for a range of turbine-inlet 
gas temperatures are shown in Fig. 8. These flows are on a 


—~ 


HOLLOW 


Fic. 7 TRANSPIRATION- 
Coo.Lep, Strut-SupportTep 
BLADE 


10 


STRUT 
POROUS 
1600 1800 2000 2200 2400 2600 


TURBINE INLET TEMPERTURE, °F 


Fic. 8 Comparison or ReLative CooLant-FLow REQUIREMENTS 
FOR SEVERAL Types or Arr-Coo_ep TuRBINE BLADES 


z 
= 
w 
a 
= 
z 
w 
> 
< 
a 


relative basis because the absolute magnitude of the flows is de- 
pendent upon many conditions, such as the type of engine, size 
of biade, flight speed, and flight altitude. A review of blade- 
temperature calculation methods is given in (26). More specifi- 
cally, the coolant flows for the hollow, tube-filled, and corru- 
gated-insert blade were calculated by methods presented in 
(27) to (29), the strut blade by (19) and the porous blade by 
(24), (30), and (31). The load-carrying member of all blades 
was assumed to be cooled to 1240 F. 

The use of plain hollow blades is impractical for cooled turbines 
because the coolant-flow requirements are exorbitant. On the 
other extreme is the transpiration-cooled turbine blade which 
ideally requires only very small amounts of coolant even for large 
increases in turbine-inlet temperature. Generally, it is desirable 
to use blades requiring the smallest coolant flow, but other fac- 


649 


tors such as fabrication problems, cooling-air pressure required, 
clogging (such may be encountered with transpiration-cooling), 
blade weight, and durability also must be considered. As a re- 
sult, the final choice of the blade design is up to the designers’ 
discretion and will be dependent upon the over-all design and 
proposed application of the engine. 

Air-Cooled Turbine-Disk Configurations. The use of air-cooled 
turbine blades will require a type of turbine-rotor construction 
different from that in current use. There is, however, a considera- 
ble amount of freedom in the type of design possible. Two main 
types of turbine rotor are the split disk, Fig. 9, and the shrouded 


(a) UPSTREAM (b) OOWNSTREAM (¢) HOLLOW SHAFT 
INLET INLET INLET 


Fic. 9 Sprit-Disk-Type, Arr-CooL_ep, TurBINE CoNFIGURATIONS 


(a) UPSTREAM (b) DOWNSTREAM 
INLET INLET 


10 Arr-Coo.en, 
FIGURATIONS 


(c) HOLLOW SHAFT 
INLET 


Fia. Tursine-Disk Con- 


disk, Fig. 10. A discussion of these disk configurations is given 
in (32). With either type of construction the cooling air can be 
supplied from the upstream direction, the downstream direction, 
or through a hollow turbine shaft. With any of these possible 
types of construction, internal vanes are required in the turbine 
rotor to direct and help pump the cooling air out to the blades. 
Up to the present time, experimental tests have been conducted 
on several turbines with the type of disk configuration shown 
in Fig. 9(b), and some of the results presented in (32) to (34) 
indicate that disk cooling will be adequate with the amount of air 
required for blade cooling. 

Liquid Cooling. Internal water-cooled turbines have been 
operated in this country and in Europe (35) to (37). This 
method of operation was logical because of the excellent heat- 
transfer characteristics of water. Water as a turbine coolant, 
however, has one very serious disadvantage—the boiling point 
is so low that unless the entire coolant system is under very 
high pressurization the turbine is overcooled and the heat-rejec- 


a 
| 
1:7 
| | 
ol 


650 


tion rates are excessive. A further disadvantage occurs at high 
flight speeds because the ram-air temperature exceeds the boil- 
ing temperature of water at normal pressures, and heat rejection 
in a ram-air heat exchanger becomes difficult. The ram-air tem- 
perature reaches 212 F at a flight Mach number of about 1.2 
at standard sea-level conditions and at a Mach number of about 
1.9 in the stratosphere. 

Pressurization of the entire coolant system would offer relief, 
but in order to pressurize the system to a point where water 
could be at a temperature that would not result in overcooling 
of the turbine, the system would have to be pressurized to ap- 
proximately 3000 psi. A method of eliminating this difficulty 
would be to use special coolants such as Dowtherm or liquid 
metals which have higher boiling points than water at normal 
pressure levels. The design of liquid-coolant systems requires a 
knowledge of heat-transfer coefficients for forced and free con- 
vection. Considerable experimental and analytical heat-transfer 
data (38) to (41), have been obtained that are applicable to 
liquid-cooled turbines. 

The British are giving thought to the use of a thermosiphon 
liquid-coolant system for turbine blades (42) and (43). In this 
type of system, a small amount of coolant is placed in the tur- 
bine—not enough to fill the blade-coolant passages. The coolant 
evaporates in the turbine blades. The vapor is passed over a 
heat exchanger and is condensed. The condensate then flows 
back to the turbine blades to be evaporated again. This system 
holds promise for some applications. Its primary advantage over 
other types of liquid-coolant systems is that the internal pres- 
sures in the blades can be considerably lower. 

Water-spray cooling as discussed in (44) offers possibilities for 
certain limited applications where a boost in power is required 
for short periods of time. This method of running for prolonged 


periods of time results in excessive liquid consumption in the 


engine because the water used as the coolant is lost. Using this 
method of cooling the rotor blades alone permits increasing the 
turbine-inlet temperature to the point where the stator blades 
provide a limitation. For relatively small rotor blades, the tem- 
peratures can be reduced adequately by spraying water onto the 
blade from a few (varying from | to 8) stationary circumferential 
locations. However, because of the low water velocities rela- 
tive to the gas and turbine-wheel velocities, the coolant supplied 
from stationary locations only impinges on the blades near the 
leading edges, and cooling may not be provided at the trailing 
edges of large rotor blades. A “rotating gutter’’ on the rotor as 
suggested in (44) could supply coolant to the trailing edges. 

In order to provide spray cooling to the stator blades so that 
higher turbine-inlet temperatures could be obtained than are 
possible with rotor cooling alone would require a more compli- 
cated water-spray system, and, of course, the coolant flow would 
have to be increased. The total flow would be about 2 to 4 times 
the weight-flow rate of fuel required. 

Consideration has been given to fuel injection ahead of the 
turbine for afterburner use (45). Although very few experi- 
mental temperature data are available, it is not believed that the 
quantities of fuel required for suitable afterburning temperatures 
would be adequate for use in cooling of the blades; water would 
still have to be injected to augment the cooling from the fuel 
spray. 

The time that water-spray cooling would probably be most 
beneficial would be during take-off of aircraft having marginal 
take-off power. For such a case, the turbine-inlet temperature 
could be raised a few hundred degrees. The water that would 
be used for cooling would be expended during the take-off run, 
and the weight of the coolant would not have to be carried during 
the remainder of the flight. This method of cooling would also 
be useful for short bursts of emergency power during flight. 


TRANSACTIONS OF THE ASME 


In conclusion it can be stated that, for many applications of 
gas-turbine engines, turbine cooling can offer substantial perfor- 
mance benefits. For these cases there are many methods of 
turbine cooling that are possible, leaving considerable choice to 
the engine designer. 


BIBLIOGRAPHY 


1 “The Advantages of High Inlet Temperature for Gas Turbines 
and Effectiveness of Various Methods of Cooling of Blades,”’ by O. 
W. Schey, ASME Paper No. 48—A-105. 

2 ‘Methods for Rapid Graphical Evaluation of Cooled or Un- 
cooled Turbojet and Turboprop Engine or Component Performance 
(Effects of Variable Specific Heat Included),”’ by J. B. Esgar and R. 
R. Ziemer, NACA TN 3335, 1955. 

3 “Extension of Boundary-Layer Heat-Transfer Theory to 
Cooled Turbine Blades,’’ by W. B. Brown and P. L. Donoughe, 
NACA RM E50F02, 1950. 

4 “Effect of Turbine-Blade Cooling on Efficiency of a Simple Gas 
Turbine Power Plant,’’ by W. M. Rohsenow, Trans. ASME, vol. 
78, 1956, pp. 1787-1794. 

5 “Comparison of Effectiveness of Convection-, Transpiration-, 
and Film-Cooling Methods With Air as Coolant,”’ by E. R. G. Eckert 
and J. N. B. Livingood, NACA Report 1182, 1954. 

6 “A Survey of German Hollow Turbine Blade Development. 
Pt. 1—Initial Investigations and Developments,”’ by E. N. Petrick, 
Purdue University, Purdue Research Foundation, published by 
USAF-AMC, Wright-Patterson Air Force Base, Dayton, Ohio, 
October, 1949. (USAF Contract W33-038-ac-17625.) 

7 “A Survey of German Hollow Turbine Blade Development. 
Part II—The Design Features and the Production of Hollow Turbine 
Blades and Hollow Turbine Rotors for Turbosuperchargers and Tur- 
bojet Engines,”” by E. N. Petrick, Purdue University, Purdue Re- 
search Foundation, published by USAF-AMC, Wright-Patterson Air 
Force Base, Dayton, Ohio, December, 1949. (USAF Contract W33- 
038-ac-17625.) 

8 “A Survey of German Hollow Turbine Blade Development. 
Part I1I—Turbine Blade Vibrations; Considerations on the Im- 
provement of Hollow Blade Designs,"’ by E. N. Petrick, Purdue Uni- 
versity, Purdue Research Foundation, pub. by USAF-AMC, Wright- 
Patterson Air Force Base, Dayton, Ohio, March, 1950. (USAF Con- 
tract W33-038-ac-17625.) 

9 “Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. I-—-Rotor Blades With 10 Tubes in Cooling-Air 
Passages,’’ by H. H. Ellerbrock, Jr. and F. 8. Stepka, NACA RM 
E50104, 1950. 

10 ‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. III—Rotor Blades With 34 Steel Tubes in Cooling- 
Air Passages,’’ by R. O. Hickel and G. T. Smith, NACA RM E50J06, 
1950. 

11 ‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. VII—Rotor-Blade Fabrication Procedures,’’ by 
R. A. Long and J. B. Esgar, NACA RM E51E23, 1951. 

12 ‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. [IX—Evaluation of the Durability of Noncritical 
Rotor Blades in Engine Operation,’’ by F. 8. Stepka and R. O. 
Hickel, NACA RM E51J10, 1951. 

13. ‘‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. X—Endurance Evaluation of Several Tube- 
Filled Rotor Blades,” by J. B. Esgar and J. L. Clure, NACA RM 
£52B13, 1952. 

14 “An Experimental Single-Stage Air-Cooled Turbine. II— 
Research on the Performance of a Type of Internally-Air-Cooled Tur- 
bine Blade,”’ by D. G. Ainley, Aircraft Engineering, vol. 25, September, 
1953, pp. 269-276. 

15 ‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. IV—Effects of Special Leading- and Trailing-Edge 
Modifications on Blade Temperature,’’ by H. H. Ellerbrock, Jr., C. F. 
Zalabak, and G. T. Smith, NACA RM E51A19, 1951. 

16 ‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. VI—Conduction and Film Cooling of Leading and 
Trailing Edges of Rotor Blades,’’ by V. L. Arne and J. B. Esgar, 
NACA RM E51C29, 1951. 

17 ‘‘Experimental Investigation of Air-Cooled Turbine Blades in 
Turbojet Engine. VIII—Rotor Blades with Capped Leading Edges," 
by G. T. Smith and R. O. Hickel, NACA RM E51H14, 1951. 

18 ‘‘Temperature Measurement on Two Stationary Bucket Pro- 
files for Gas Turbines with Boundary-Layer Cooling,” by K. H. 
Kuepper, Trans. No. F-TS-1543-RE, Air Materiel Command, U. 8. 
Air Force, January, 1948. (ATI No. 18576, CADO). 


APRIL, 1957 


19 “Use of Electric Analogs for Calculation of Temperature oi 
Cooled Turbine Blades,”’ by H. H. Ellerbrock, Jr., E. F. Schum, and 
A. J. Nachtigall, NACA TN 3060, 1953. 

20 ‘Wire Cloth as Porous Material for Transpiration-Cooled 
Walls,”’ by E. R. G. Eckert, M. R. Kinsler, and R. P. Cochran, NACA 
RM E51H23, 1951. 

21 ‘Experimental Investigation of Air-Flow Uniformity and 
Pressure Level on Wire Cloth for Transpiration-Cooling Applications,” 
by P. L. Donoughe and R. A. McKinnon, NACA TN 3652, 1956. 

22 ‘‘New Sintered-Wire Porous Metal Shows Promise for Avia- 
tion Use,”’ by Irving Stone, Aviation Week, vol. 63, July 11, 1955, pp. 
65-69. 

23 ‘‘Experimental Investigation of Coolant-Flow Characteristics 
of a Sintered Porous Turbine Blade,’’ by E. R. Bartoo, L. J. Schafer, 
Jr., and H. T. Richards, NACA RM E51K02, 1952. 

24 ‘Survey of Advantages and Problems Associated with Trans- 
piration Cooling and Film Cooling of Gas-Turbine Blades,”’ by E. R. 
G. Eckert and J. B. Esgar, NACA RM E50K15, 1951. 

25 ‘One-Dimensional Calculation of Flow in a Rotating Passage 
with Ejection Through a Porous Wall,” by E. R. G. Eckert, J. N. B. 
Livingood, and E. I. Prasse, NACA TN 3408, 1955. 

26 ‘‘Some NACA Investigations of Heat-Transfer Characteristics 
of Cooled Gas-Turbine Blades,’’ by H. H. Ellerbrock, Jr., paper pre- 
sented at General Discussion on Heat Transfer, The Institution of 
Mechanical Engineers, (London) and ASME (New York), Conference 
(London), September 11-13, 1951. 

27 ‘Analysis of Spanwise Temperature Distribution in Three 
Types of Air-Cooled Turbine Blades,”’ by J. N. B. Livingood and W. 
B. Brown, NACA Report 994, 1950. 

28 ‘Procedure for Calculating Turbine Blade Temperatures and 
Comparison of Calculated With Observed Values for Two Stationary 
Air-Cooled Blades,’’ by W. B. Brown, H. O. Slone, and H. T. Rich- 
ards, NACA RM E52H07, 1952. 

29 ‘‘A Summary of Basic Heat Transfer and Flow Friction Design 
Data for Plain Plate-Fin Heat Exchanger Surfaces,’’ by W. M. Kays 
and 8. H. Clark, Technical Report No. 17, prepared under Contract 
N6-ONR-251 Task Order 6 (NR-090-104) for Office of Naval Re- 
search, Department of Mechanical Engineering, Stanford University, 
Stanford, Calif., August 15, 1953. 

30 “A Simplified Theory of Porous Wall Cooling,’’ by W. D. 
Rannie, Progress Report No. 4-50, Power Plant Laboratory Project 
No. MXS801, Jet Propuision Laboratory C.I.T., November 24, 1947 
(AMC Contract No. W-535-ac-20260, Ordnance Department Con- 
tract No. W-04-200-ORD-455). 

31 “A Theoretical and Experimental Investigation of Rocket- 
Motor Sweat Cooling,”’ by Joseph Friedman, Journal of the American 
Rocket Society, No. 79, December, 1949, pp. 147-154. 


651 


32 “Investigations of Air-Cooled Turbine Rotors for Turbojet 
Engines. II—Mechanical Design, Stress Analysis, and Burst Test of 
Modified J33 Split-Disk Rotor,”’ by R. H. Kemp and M. L. Moseson, 
NACA RM E51J03, 1952. 

33 “Investigations of Air-Cooled Turbine Rotors for Turbojet 
Engines. I—Experimental Disk Temperature Distribution in Modi- 
fied J33 Split-Disk Rotor at Speeds up to 6000 Rpm,” by W. B. 
Schramm and R. R. Ziemer, NACA RM E51I11, 1952. 

34 “Investigations of Air-Cooled Turbine Rotors for Turbojet 
Engines. III—Experimental Cooling-Air Impeller Performance and 
Turbine Rotor Temperatures in Modified J33 Split-Disk Rotor up to 
Speeds of 10,000 Rpm,” by A. J. Nachtigall, C. F. Zalabak, and 
R. R. Ziemer, NACA RM E52C12, 1952. 

35 ‘‘Heat-Transfer and Operating Characteristics of Aluminum 
Forced-Convection and Stainless-Steel Natural Convection Water- 
Cooled Single-Stage Turbines,’’ by J. C. Freche and A. J. Diaguila, 
NACA RM E50D03a, 1950. 

36 ‘The Possibilities of the Gas Turbine for Aircraft Engines,”’ by 
E. Schmidt, Reports and Transactions No. 489, GDC 2504T, British 
M.OS. 

37 ‘Application of Internal Liquid Cooling to Gas-Turbine Ro- 
tors,’’ by 8. Alpert, R. E. Grey, and D. D. Drake, Trans. ASME, vol. 
78, 1956, pp. 1257-1266. 

38 “Experiments on Mixed-, Free- and Forced-Convective Heat 
Transfer Connected with Turbulent Flow Through a Short Tube,”’ by 
E. R. G. Eckert, A. J. Diaguila, and A. N. Curren, NACA TN 2974, 
1953. 

39 “Convective Heat Transfer for Mixed, Free, and Forced Flow 
Through Tubes,”’ by E. R. G. Eckert and A, J. Diaguila, Trans. 
ASME, vol. 76, 1954, pp. 497-504. 

40 ‘“Free-Convection Effects on Heat Transfer for Turbulent 
Flow Through a Vertical Tube,’’ by E. R. G. Eckert, A. J. Diaguila, 
and J. N. B. Livingood, NACA TN 3584, 1955. 

41 “Review of Experimental Investigations of Liquid-Metal 
Heat Transfer,’’ by Bernard Lubarsky and 8. J. Kaufman, NACA TN 
3336, 1955. 

42 ‘Heat-Transfer Problems of Liquid-Cooled Gas-Turbine 
Blades,”’ by Henry Cohen and F. J. Bayley, paper for presentation to 
The Institution of Mechanical Engineers, London, England, 1955. 

43 “Free Convection in an Open Thermosiphon, with Special 
Reference to Turbulent Flow,”’ by B. W. Martin, Proceedings of the 
Royal Society of London, series A, vol. 230, 1955, pp. 502-530. 

44 ‘‘A Novel Cooling Method for Gas Turbine,’’ by Edward Burke 
and G. A. Kemeny, Trans. ASME, vol. 77, 1955, pp. 187-195. 

45 ‘‘Turbojet Afterburning Without Any Afterburner,” by H. E. 
Schmitt, Aeronautical Engineering Review, vol. 9, December, 1950, pp. 
18-24. 


- 


Generalized Optimal Heat-Exchanger Design’ 


By D. H. FAX? anno R. R. MILLS, JR.* 


There are many different ways of formulating the prob- 
lem of the design of an optimum heat exchanger (or of 
any plant element or ensemble of elements) depending on 
what measure of performance is to be extremized and un- 
der what constraints the optimum is to be effected. A 
large number of such solutions exist in the literature but 
they often appear to have little in common, each variation 
in constraints apparently generating a brand new prob- 
lem. Based on the use of Lagrangian multipliers, a gen- 
eralized method is here developed which shows perhaps 
more clearly than before the interrelationship between 
different problems of this class. The method is particu- 
larly useful in those problems where several variables are 
to be optimized simultaneously. In the paper, the method 
is exemplified by the solution of three problems in gas- 
turbine regenerator design, at least two of which are be- 
lieved to be new. 


NOMENCLATURE 
The following nomenclature is used in the paper: 


A = heat-transfer area, sq ft 
c = ratio of air-side free-fiow area to air-side frontal 
area, see Fig. 1 
t constant pressure specific heat, Btu/lb F 
C1, C2, C2, Cay Ce functions of x, 7, nr, and 7,, defined in Equa- 
tions [25] through [29] 
d hydraulic diameter of flow passages 
E, cycle thermal efficiency 
fanning friction factor 
ratio of heat-exchanger mean temperature dif- 
ference to that which would obtain in coun- 
terflow 
32.174 X (3600)? lb ft/hr? lbs» 
mass velocity, W/S, lb/hr sq ft 
heat-transfer coefficient, Btu/hr sq ft F 


h 

E A Pr 


= isentropic exponent 
= Mach number based on frontal areas, 


for air and gas sides, respectively 
= hA/We,, number of transfer units, with sub- 
script denoting air or gas side 
p = pressure, psf, with subscripts as denoted in Fig. 2 


1 Based on an essay submitted to the Faculty of Engineering of The 
Johns Hopkins University, Baltimore, Md., in partial fulfillment of 
the requirements for the degree MSc Eng., May, 1954. 

? Advisory Engineer, Commercial Atomic Power Activity, Westing- 
house Electric Corporation, Pittsburgh, Pa.; formerly, Assistant 
Professor of Mechanical Engineering, The Johns Hopkins University, 
Baltimore, Md. Assoc. Mem. ASME. 

? Research Assistant in Mechanical Engineering, The Johns 
Hopkins University, Baltimore, Md. Assoc. Mem. ASME. 

Contributed by the Heat Transfer Division and presented at a 
joint session with the Gas Turbine Power Division at the Semi-An- 
nual Meeting, Cleveland, Ohio, June 17-21, 1956, of Tae Amert- 
can Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, Novem- 
ber 16, 1955. Paper No. 56—SA-19. 


= Jacobian operator 


NTU 


(Ap/p).. 9.¢ fractional pressure drops on air and gas sides of 
regenerator and in combustor 
total fractional pressure drop, defined in Equa- 
tion [11] 
Pr Prandtl number 
S free-flow area, sq ft 
temperatures, deg R, with subscripts as de- 
noted in Fig. 2 
specific volume, cu ft/lb 
mass flow rate, lb/hr 
l—e 
In 
regenerator dimensions, ft, as defined in Fig. 1 
a dimensionless grouping defined in Equation 
[34] 
regenerator effectiveness 
= adiabatic efficiencies of compressor and turbine 
a Lagrangian multiplier 
T;/T,, ratio of turbine-inlet to compressor-inlet 
temperatures 


(Ap/p)r 


( P2/Pi) k compressor isentropic temperature 
ratio 


INTRODUCTION 


A number of writers have treated the problem of the design of 
optimal heat exchangers, a few of which are given in references 
(1-4).* The problems differ according to what measure of per- 
formance is to be extremized and under what types of constraints 
the optimum is to be effected. A study of the literature may give 
the impression that these problems have very little in common and 
that each additional variation in the constraints must perforce 
generate an entirely new problem. 

Lagrange’s method of multipliers is used in many branches of 
applied mathematics and physics to deal with the problem of ex- 
tremizing a function of a number of variables which are, in turn, 
related by a number of equations of constraint. It is the purpose 
of this paper to show the application of this method to the optimal 
design of gas-turbine regenerators, and in particular to show the 
advantages provided by the method in revealing the points of 
similarity in a class of related problems. 


Tue Merson or LAGRANGIAN MULTIPLIERS 


Although a derivation of this method can be found in a number 
of places (e.g., 5, 6, 7), the form of the result to be used here is 
sufficiently different to warrant a special derivation. 

Given a function of n variables 


f = f(x, T2,-- 


not all of which are independent but are related by m equations 
of constraint 


j = 1,2, ,m; m<n. It is presumed that f and all the are 
differentiable throughout the range of interest. 

The ‘conventional’ method of extremizing the function f, pre- 
suming an extremum to exist, is to combine the ’s with f so as to 


4 Numbers in parentheses refer to the Bibliography at the end of the 
paper. 


653 


654 


eliminate m of the z’s, leaving f as a function of n — m independent 
variables, then to equate the partial derivative of f with respect to 
each of the remaining x’s to zero, and finally to solve this set of 
simultaneous equations for the optimal z’s. This method can 
lead to some awkward algebra, particularly when the ’s involve 
transcendental functions; more important, it can obscure the 
relationships among a class of problems which differ in only one 
(or a few) of the equations of constraint. The method to be pre- 
sented frequently can obviate, or at least reduce, both of these 
difficulties. 

A necessary condition for an extremum in f is that, at the ex- 
tremum 


Since the w’s are equal to zero 


dy; = 


t=1 


dz; = 0... 


Multiplying each dy, by \,, the Lagrangian multipliers, and add- 
ing all these products to Equation [3] 


t=1 j=l ox 


which, upon rearrangement, reads 


Of the n brackets in Equation [5], m can be made zero by 
suitable choice of the m X’s; the remaining n — m brackets also 
must be separately zero since n — m of the z’s are independent. 
Consequently we have n equations of the type 


which, together with the m Equations of Constraint [2], can be 
solved explicitly for the n optimal z’s and the m )’s. 

The elimination of the m X’s can be performed at once. Choos- 
ing any m of Equations [6], say, the first m, and solving in the 
usual manner, we obtain 


Ti, .. [7] 


where J denotes the Jacobian determinantal operator. It is to be 
noted that in order to effect these solutions, the denominator of 
Equation [7], common to all m of the \’s, must not be identically 
zero. This condition, that the mth-order determinant 


J kK 
» 2m 

limits the choice as to which set of m of the n Equations [6] can 
be used to solve for the \’s. Now it can be shown that Equation 
[8] is both a necessary and sufficient condition that the equations 
of constraint be functionally independent with respect to the 
variables 71, , 2m. Hence, if no set of m of the z’s can be chosen 
so as to satisfy the Inequality [8], then at least one of the equa- 
tions of constraint can be formed by some combination of the 
others. If this be so, then the problem stated by Equations [1] 


TRANSACTIONS OF THE ASME 


and {2} must be restated in terms of (at least) one less y and 
(at least) one more degree of freedom. 

In order to exclude the trivial case 1; = 0, the numerators of 
Equations [7] cannot all be zero. Thus it is further necessary 
that the m z’s be chosen such that not all of the of/dz; are 
identically zero. 

Having satisfied these two conditions, the solutions for the \’s 
may be substituted into the remaining n — m of Equations [6], a 
typical one of which will look like 


Vi, » Wm} Of 
dr, 
Vi, f, Vin, Vm oy; 


where r = m + 1, m + 2, , n. This form is recognized as the 
expansion by cofactors of the (m + 1)th-order determinant 


Zi, 


The solution for the n optimal z’s is given by the m equations 
of constraint and the n — m determinantal Equations {10]. The 
\’s do not appear explicitly in the final formulation of the 
method. The algebra required to solve the n simultaneous equa- 
tions may still be quite tedious. Nevertheless, the present 
method should be simpler than the conventional approach in 
many cases because the final solution can always be effected, if 
necessary, by numerical or graphical means and no calculus 
operations remain to be performed on the solutions so found. 

It is obvious that if one or more of the equations of constraint 
can be written in the form 


(x1, Z2, 


the preceding work remains unchanged. It is further obvious 
that the interchange of any columns within any of the determi- 
nants denoted by Equations [10] does not change the value of the 
determinant. Thus it is clear that the role of the function of f can 
be interchanged with that of any one of the constraints which can 
be written as above without affecting the final result, providing, 
of course, that a set of m z’s satisfying the Inequality [8] can be 
found in each case. In other words, given a set of m + 1 independ- 
ent measures of performance which are functions of a larger set 
of design variables which may be optimized, then regardless of 
which one of the performance criteria is to be extremized (the 
other m being fixed), the optimal relations among the design 
variables is the same. This important result, which will be ex- 
emplified in the following, is not so apparent from the conven- 
tional approach. 


, Z,) = const; 


APPLICATIONS OF THE METHOD 


The Lagrangian method will be exemplified by the solution of 
three problems relating to the design of gas-turbine heat exchang- 
ers. In each case, the exchanger is presumed to be of the cross- 
flow plate-fin type, Fig. 1, and the thermodynamic process of 
which it is a part is shown in Fig. 2. The first problem, solved in 
somewhat different form by Aronson (2), is to find the optimal 
division between free-flow areas and NTU on the two sides of the 
exchanger for a prescribed over-all effectiveness and cycle pressure 
ratio. The second problem will lift the prescription of the over-all 
effectiveness, and the third problem will lift the prescription of the 
cycle pressure ratio as well. 


OptTiMAL EXCHANGER PROPORTIONS FOR PRESCRIBED EFFECTIVE- 
NESS AND CycLE PrREssuRE Ratio 


It is well known (2) that the irreversibilities caused by pressure 


| | 
— 


APRIL, 1957 


Fic. 2. Reaenerative Gas-TurBINE CyYcLe 


(Path 1-2, compressor process; 2-3, regenerator air side; 3-4, combustor; 
4-5, turbine; 5—6, regenerator gas side.) 


drops in the gas-turbine process in Fig. 2 can be expressed by the 


where the term on the left is proportional to the total irreversibil- 
ity occasioned by the pressure drops and the terms on the right 
are the fractional drops in the air side and gas side of the ex- 
changer and in the combustor, respectively. For the air side of 
the regenerator, one may write 


(*”) Pr**W%,(NTU), 
P/a n9-P284* 


where v, is the mean specific volume on the air side, S, = c XZ is 


655 


the free-flow area for the air, and n = (2j/f) relates the heat-trans- 
fer and friction characteristics of the exchanger surface. Follow- 
ing Aronson (2), it will be assumed that variations in 7 (which de- 
pend on Reynolds number) may be ignored as a second-order 
effect. It will be further assumed, but only for convenience in 
symbolism, that 7 is the same on both sides of the exchanger and 
it has been tacitly assumed in Equation [11] that the mass rate of 
flow W is the same for both air and gas. 

Using the p-v-7 relation for perfect gases, Equation [12] may 
be rewritten in the form 


(= M,? NTU, 
\ 


where k is the isentropic exponent and M, is the Mach number 
based on the total frontal area XZ and the mean temperature on 
the air side. Specification of M, and the analogously defined M, 
specifies in a convenient manner the two frontal areas. These 
Mach numbers will be quite low, in the neighborhood of 0.01 to 
0.05, for most plant types. The choice of specific values would 
depend upon cost factors beyond the scope of the present discus- 
sion. 

For the present problem, therefore, we wish to minimize the 
function 


(=*) kee 
P/r n 


M,? 
—c)? 


{13] 


subject to the constraint that a specified amount of heat shall be 
transferred. For the case of equal heat-capacity rates on the two 
sides, this constraint can be expressed as 


1 1 
17... 
vi NTU, NTU, ( € ) 
where F, the ratio of the actual mean temperature difference to 
that which would obtain in counterflow, is itself a function of the 


effectiveness € and the specific flow arrangement (8). The fune- 
tion y, is thus a measure of the over-all thermal resistance of the 
exchanger. 


MATRIX FOR THE OPTIMIZATION OF REGENERATOR 
PROPORTIONS 


TABLE 1 


The partial derivatives to be used in the solution of the prob- 
lem are arranged in matrix form in Table 1. The determinants 
which are to be set equal to zero will be formed from the rows and 
columns of this matrix. It will be seen that either NTU, or 
NTU, may be chosen as the variable satisfying Inequality [8]. 
The simultaneous solution of the equations 


yields the result that at the optimum 


= 
| 
EADDD 
V + 


= 1/(1 + V/M,/M,) 


Combining these results with Equations [14] and [15], we further 
find that at the optimum design 


M, 
= [1+ Me | 


€ M, 
NTU, = 
and 


ap - (*) kPrvt 
Pp n G—or + . . [20] 


As an example of the interchangeability of the constraint with 
the function to be extremized, it is to be noted that the identical 
results are obtained whether the problem be to minimize the over- 
all Ap/p for a prescribed over-all thermal resistance, or to mini- 
mize the thermal resistance for a specified over-all Ap/p. 

The specification of the two frontal area Mach numbers, to- 
gether with the optimal Relations [17], [18], and [19], does not 
completely fix the design of the two sides of the exchanger. Given 
the heat-transfer and friction characteristics of the surfaces as 
functions of Reynolds number, we are yet free to choose any one 
of the following dimensions: X,Y, Z, d,, or d,. It is easily shown 
that if one of the hydraulic diameters be chosen for specification, 
then the over-all volume decreases continuously as that diameter 
is made smaller while the two frontal areas, which are kept con- 
stant, become more and more elongated in the Z (nonflow) 
direction. This can, of course, lead to awkward ducting even 
though the volume of the exchanger itself can become relatively 
small. The completion of this problem, consequently, depends on 
an interrelation of cost factors and plant layout problems which, 
in turn, depend on the nature of the specific application. This has 
previously been pointed out in another context by Aronson (9). 


NTU, 
NTU, 


[19] 


EFFECTIVENESS FOR PRESCRIBED PRESSURE 


In this second problem, we shall determine the regenerator 
effectiveness (as well as its proportions) which maximizes the gas- 
turbine-cycle thermal efficiency for prescribed values of the over- 
all pressure ratio and over-all temperature ratio. Our first task 
will be to write the thermal efficiency in a form amenable to the 
analysis which follows. 

Under the assumption of constant specific heat, the cycle ther- 
mal efficiency (by reference to Fig. 2) is given by 


_ 
(T, — Ts) 


E, 


which, upon combination with the regenerator effectiveness 
e = (T; — T2)/(T; — T2) 
is written as 


(7, — Ts) — (7: — 
— Ts) + (1 —e) — —(T:2 — T1)] 


E, [21] 


The temperature rise in the compressor is 
T: — T, = T(x —1)/” 

x = * 


TRANSACTIONS OF THE ASME 


and 7, is the compressor adiabatic efficiency. The temperature 
drop in the turbine is 


1 
T, Ts = Nr!'s (: *) 
Xr 


where 7,7 is the turbine adiabatic efficiency and 


k-1 


) . | 1 + (Ap/p), ] 
1 — (Ap/p), — (Ap/p), 


The fractional pressure drops given in the last expression are quite 
small compared with unity, so to within the first order in (Ap/p) 


T, Ts = Nr!'s 1 [23] 


where (Ap/p)r is the sum of the three fractional pressure drops. 
Inserting Equations [22] and [23] in [21] and again neglecting 
terms of order (Ap/p)* and higher, we arrive at 


1 
[« — + = (cz — es€) (*”) tee 
E, P/r 


— — x)... 


[24] 


[25] 
= — 1)/k... [26] 
— 1).. [28] 


cs = 


C3 


and tT = 7/7), the over-all temperature ratio. Equation [24] is 
the function to be minimized. 

The relation between € and (Ap/p), is given by the equations 
of constraint. The function which was minimized in the preced- 
ing problem becomes one of the constraints in the present problem, 
namely 

NTU, 


M,? NTU 
c 


a —c)}* 


Ap 
kPr’/* Pp 


while 


J 
_= — -— F=0.....[31 
Vs NTU, * NTU, ( € ) BH 
Since F is a function of €, which is now no longer fixed, we must 


include a relation between them. This relation depends on 
the specific flow arrangement and upon the degree of mixing in the 


directions normal to their flow for each of the two fluids. For 
the purpose of the example we choose a single-pass cross-flow ex- 
changer in which one of the fluids is assumed to be mixed, the 
other unmixed. From Equation [11] of reference (8), the relation 
for this case can be expressed (when the two heat-capacity rates 
are equal) by 


€ 
(i—e) [i +In(i—e)] 


+ 0...... [32] 


APRIL, 1957 


TABLE 2 MATRIX FOR THE OPTIMIZATION OF EFFECTIVENESS AND CYCLE PRESSURE RATIO 


r 


\ re) re) 


1-2 


[x dn (1 x)]* 


Equations [24], [30], [31], and [32] complete the statement of 

this second problem. Table 2 shows the matrix of partial deriva- 

tives appropriate to the method used herein. It is evident that 

one set of variables satisfying Inequality [8] is F, (Ap/p)7, NTU,. 
Simultaneous solution of the equations 


[ 1/E,, Ya, Ws | 
P, (Ap/p)r, NTU,, NTU, 


and 


F, (Ap/p)r, NTU,, 


yields results duplicating those of the preceding problems as given 
by Equations [16] through [20]. This indicates that, under the 
assumptions made here, the optimal proportions of the regenera- 
tor depend on € in a manner that is independent of whether € has 
itself been optimized or not. The optimal ¢ is given by the solu- 
tion of the fourth determinantal equation 


F, (Ap/p)r, NTU,, € 


which can be written as 


E Vy Vs 


14a 
5 


A 
[33] 
where 
y = Inz = In(1 — e) 
and 
= [/M, + VM, ]*........... [34] 


Equation [33] cannot be solved explicitly for y (or €) but the 
form given here is particularly suitable for iterative solution. An 
assumed value of y inserted on the right yields an improved value 
on the left and the process is repeated to convergence. From 


Nore: The first six columns apply to the second problem in the text; the entire matrix applies to the third problem. 


Equation [32] it can be shown that as F varies from zero to unity, 
€ covers the range (e — 1)/e to zero; hence y has the limits —1 
and zero. The iteration indicated by Equation [33] converges 
rapidly regardless of which end of the range is chosen for the first 
step in the process. 

Having found the optimal value of ¢€ from Equation [33], the 
optimal (Ap/p)7 can be evaluated by combining Equations [20] 
and [32]; itis 

(Ap/p)r = (Ap/p), —T In [1 + In (1 —e)]..... [35] 


The fractional pressure drop in the combustor has of course been 
assumed to be a prescribed constant. 


PRrEssuRE Ratio 


If we now set the cycle pressure ratio free and wish to find its 
optimal value (for prescribed over-all temperature ratio Tr), we 
add a fourth degree of freedom to the system described in the pre- 
vious problem. Examination of Table 2 shows that the preceding 
work remains applicable to the present case; the optimal propor- 
tions of the regenerator are still given by Equations [16] through 
[20] and the optimal effectiveness by the soluton of Equation [33]. 
Hence the present problem requires the solution of only one new 
equation 


af 1/E,, ¥:, ] 
FP, (Ap/p)z, x 
Because of the three zero elements in the last column of the matrix 
of Table 2, this equation reduces to 

[ v2, Vs ] &(1/E;) 

F, (Ap/p)r, NTU, ox 

The first term of this product has already been shown to be non- 
zero, hence this third problem is solved by setting 


(1/E;) 
ox 


=0 


=0 


Performing the indicated differentiation on Equation [24] and 
using Equations [25] through [29], we arrive at an equation of 
the fourth degree in the optimal value of x. The equation can be 
solved, however, by a rapidly converging iteration in the following 
form 


657 
2 Ma NTU, 
4 
|__| 


658 


[ryr(2e — 1) + (r — — €) — — 1)x 
+ — 1) — nr — 1X1 — 


k p/r — X 


(1 — — 1) [ — 1) 
x—1 — X 


The procedure is as follows: A reasonable value of x is assumed 
and the corresponding optimal € and (Ap/p); are computed by 
the equations given in the first two problems. With these sub- 
stituted in Equation [36], together with the initial assumption of 
x where it appears on the right of that equation, the resulting 
quadratic is readily solved for an improved value of x and the 
process is repeated. Subsequent solutions of Equation [36] have 
been shown by trial to yield oscillating values of decreasing ampli- 
tude for x; hence convergence can be hastened by judicious 
choices of the value of x for the successive steps of the iteration. 


+ 


SuMMARY 


The method of Lagrangian multipliers is presented in a form 
which does not require the explicit solution for the multipliers 
themselves. To optimize a set of n variables which are interre- 
lated by m equations of constraint, it is only necessary instead to 
solve simultaneously a set of n — m determinants of (m + 1)th 
order each set equal to zero, together with the m equations of 
constraint, in order to obtain directly the optimal values of the n 
variables. 

Applied to problems of process and plant design, it is believed 
that this method can afford more insight in the synthesis of the 
mathematical statement of the problem and can be simpler in the 
execution of the solution than more widely used methods. In 
particular, it lends itself more readily to numerical evaluation of 
results where the equations of constraint are transcendental in 
form. 

In the paper, the method is applied to the problem of simul- 
taneously optimizing the cycle pressure ratio, and the regenerator 
effectiveness, pressure drops, and proportions for maximum ther- 
mal efficiency in a gas-turbine cycle of specified turbine and com- 
pressor-inlet temperatures. It becomes clear that the optimization 
of the regenerator proportions is independent of the optimiza- 
tion of the regenerator effectiveness and that both are independ- 
ent of the optimization of the cycle pressure ratio. 

While a particular function, Equation [32], was chosen to relate 
the effectiveness to the mean temperature difference, any other 
applicable relationship would work as well and such a change 
would involve only the last row of the matrix in Table 2, Simi- 
larly, if it were required that some function other than thermal 
efficiency be extremized (e.g., cost or weight), the change would 
involve only the first row of the matrix. It is believed that this 
localization’ of changes in the statement of such problems is a 
particular merit of the method described herein. 


BIBLIOGRAPHY 


1 “Heat Transmission,” by W. H. McAdams, McGraw-Hill Book 
Company, Inc., New York, N. Y., third edition, 1954, chapter 15. 

2 “Design of Regenerators for Gas-Turbine Service’ by D. 
Aronson, Trans. ASME, vol. 72, 1950, pp. 967-978. 

3 “Optimum Design of Gas Turbine Regenerators,” by W. M. 
Rohsenow, T. R. Yoos, Jr., and J. F. Brady, ASME Paper No. 50—A- 
103. 

4 “Review of Optimum Design of Gas-Turbine Regenerators,”’ 
by D. Aronson, Trans. ASME, vol. 74, 1952, pp. 675-683. 

5 “Differential and Integral Calculus,’’ by R. Courant, Inter- 
science Publishing Company, New York, N. Y., vol. 2, 1952. 

6 “Advanced Calculus,” by W. Kaplan, Addison-Wesley Press, 
Inc., Cambridge, Mass., 1952, pp. 128-136. 

7 “Heat and Thermodynamics,” by M. W. Zemansky, McGraw- 


TRANSACTIONS OF THE ASME 


Hill Book Company, Inc., New York, N. Y., third edition, 1951, pp. 
428-429. 

8 ‘Mean Temperature Difference in Design,” by R. A. Bowman, 
A. C, Mueller, and W. M. Nagle, Trans. ASME, vol. 62, 1940, pp. 
283-294. 

9 Discussion by D. Aronson of ‘“‘Heat-Transfer and Flow-Friction 
Characteristics of Some Compact Heat-Exchanger Surfaces; Part 2— 
Design Data for Thirteen Surfaces,’"’ by W. M. Kays and A. L. Lon- 
don, Trans. ASME, vol. 72, 1950, p. 1096. 


Discussion 


Davip Aronson.’ The authors take a fresh approach to the 
problem of optimal heat-exchanger design. The writer appreci- 
ates the simplicity of treatment which the authors achieve by the 
use of Lagrangian multipliers and the solution of the equations in 
an ordered fashion by means of the Jacobian operator. The 
authors are to be commended for their clear presentation of the 
design parameters to be optimized: 


1 Optimal exchanger proportions as regards balance between 
hot and cold-passage design. 

2 Optimal effectiveness for prescribed pressure ratio. 

3 Optimal pressure ratio. 


The authors’ results for the analysis of the first topic are in 
agreement with the ones the writer obtained (reference 2 of the 
paper) and he is grateful for this independent check of his pre- 
vious work. The solution of the second topic appears to be 
correct as regards the formal mathematics, but seems in error in 
the selection of design relations. In the opinion of the writer, 
the third topic is not amenable to analytical treatment in the 
simple fashion chosen by the authors. 

The writer recently submitted to a Review Committee of the 
ASME a possible solution to the second topic, in the nature of an 
approximation. He, therefore, was looking with some eagerness 
to the solution the authors propose, since they apparently en- 
deavored to avoid approximations in their development. How- 
ever, he was disappointed to find that they had glossed over the 
same troublesome spot as he had. The problem can be stated in 
these terms: The pressure ratio of a gas-turbine cycle has been 
set, say, by the turbine designer. Limitations also are imposed 
on the design of the heat exchanger. One can place a limit on 
the heat-exchanger volume, cross-sectional area for flow, or the 
total heat-transfer-surface area. The authors do not state clearly 
which limitation they have chosen. The treatment of the prob- 
lem, however, indicates that they have specified a fixed cross- 
sectional area for flow. One must look at this restraint more 
closely. The correct statement of the condition should read: 
“Cross-sectional area for flow is fixed for a given net work.’’ The 
authors set up the problem on the basis of: ‘Cross-sectional area 
for flow is fixed for a given working-fluid flow rate.” Had the 
authors chosen to limit heat-transfer-surface area, the distinction 
between basing this limit on net work or on working-fluid flow 
rate would have been much less significant, because the optimal 
pressure drop would be about one fourth that for the limiting 
condition studied in the paper. 

The authors’ final equation is stated to be suitable for iterative 
solution; it cannot be solved explicitly. The iterative solution 
may or may not be possible for the general case of any type of 
heat-exchanger arrangement, counterflow, multipass crossflow, or 
single-pass crossflow. The condition chosen by the authors 
seems to the writer to be a strange one. They use the equation 
((32] of the paper) for a single-pass, crossflow exchanger with 
fluid mixed on one side, unmixed on the other. For this case the 
maximum effectiveness, regardless of heat-exchanger size is only 


’ Consulting Engineer, Worthington Corporation, Harrison, N. J. 
Mem. ASME. 


APRIL, 1957 


Y* For Multi-Pass 
-Counterflow Exchangers 
= 1.0 
max 
e of One Fluid Mixed in Pass - One Unmixed - 
sigs Mixed Between Passes 


(NTU) for Cross flow 
/A(NTU) for True Counterflo 


Y* For Multi-Pass Cross-Counterflow Exchangers 


or Cmin!C max * 


For Case of Both Fluids Unmixed in Passes - 


M.xed Between Passes 
+ + + 


man 


Fie. 3 


about 63 per cent. This is such a low effectiveness that it is hard 
to consider it in a study of the optimum. The heat exchanger 
illustrated in Fig. 1 of the paper would seem to approach in its 
performance more closely to the case of fluids unmixed on both 
sides, for which case the effectiveness can approach 100 per cent 
as the surface area is increased without limit. 

The writer arrived at the following solution of the problem as 
set forth by the authors 


arin, + = )( (1 + NTU)? 


where Y * is a correction term to allow for the difference in slope of 
the € versus NTU curve for the particular flow arrangement as 
compared with the counterflow arrangement for Cmin/Cmax = 
1.00. For the crossflow arrangement used by the authors, Y 
would have a value of 0.51 at NTU = 2.0 and a value of 0.18 at 
NTU = 4.0. 

The term NTU represents the over-all NTU of both sets of 
passages, as defined by 


1/NTU = 1/NTU, + 1/NTU, 


for the special case of Cmin/Cmax = 1.00. 

This indicated optimum pressure-drop ratio is in error in the 
same fashion as would be the answer obtained by the authors in 
that no allowance is made for the reduction in net work due to the 
pressure loss in the heat exchanger. 

For the case of heat-transfer-surface area being considered 


0,75 Both Fluids Unmixed 
max 


+ 
= 1,00 Both Fluids Unmixe 


—+—---.__ 


For Counterflow with C 


min = 0,75 


Y = 1,14 (Approximately over entire 
range of NTU-- 40) 


(4@)/d(NTU) For particular flow 
d TU) For counterflow with 
= 1,00 


VALUE OF Y* FOR SEVERAL SINGLE-PASS 


0.2--— 


CROSS-FLOW ARR ANGE MENTS 


3 


NTU UA/CL 


Fie. 4 


specified, the total pressure-drop ratio given by Equation [37] 
herewith, would be multiplied by a term r dependent on fin 
effectiveness and on the value of the exponents for the change of 
friction factor f and heat-transfer factor j with respect to Reynolds 
number. The term r has a value in the order of 0.25 for turbulent 
flow, with a slight increase or decrease in this value dependent on 
fin effectiveness and whether the flow is along smooth surfaces 
or across interrupted surfaces. 

The third topic considered is even more vulnerable to the criti- 
cism that it does not hold to a fixed net work. Presumably one 
could modify the equations to adjust for the decrease in net 
work per unit of working-fluid flow. Such an analysis would have 
no practical interest since the change in plant cost with change in 
pressure ratio and, hence, of flow volumes would not be given 
consideration. The selection of optimum-pressure ratio is tied in 
with the design of compressor, turbine, combustion chamber, and 
ducts as well as with the design of the heat exchanger. Perhaps 
the techniques described by the authors could be applied to such 
an analysis. This would call for inclusion of capital costs, fuel 
costs, interest and amortization schedules, and so on. The dis- 
tinction between this latter problem and the previous ones lies in 
the nature of the choices. Topics one and two can be treated 
purely on the basis of the thermodynamic consideration of the 
irreversibilities in the heat exchangers themselves. Topic three is 
primarily concerned with the optimum capital investment in the 
plant. The familiarity with the solutions indicated by proper 
treatment of topics one and two can aid materially in the solution 
of the problem posed by topic three. The authors have con- 
tributed constructively on the first two topics. It is felt that they 


659 
NTU UAIC min NTU UAIC. 
C.K 
~ + + + + 0.9.. ENS: min _— 
| | XN Cmin/C 
| NOTE! | 
0.7.4 > $ + 4 0.7... \ 
| 
| 
Os... + + + + + 4 os. 4 \ 
| 
| | 
j 
0.4. .+ = = Le | 
! Value | | a 
0.3...} Both | 
| | | 
dt % 
} | g | 
| | | | > 
2 Pass | | 
3 Pass % 
+ + + + t | 
| i ] | 4 Pass | 
0.9. 4— } - + + | 
Sincle Pass % | % | 
0.8. + + + + + + | | 
> | 
| | | 
4 
06 + + 3 
| 
| | — 
bd 2 3 ne NTU = UA/C 


660 


would enhance the value of their paper if they were to illustrate 
how their proposed solutions can be applied to the study implied 
in topic three, but not proceed with the solution. Topic three 
offers a challenge justifying another paper or several papers. 


Wa ter Daskin.6 The authors are to be congratulated for 
their lucid exposition of a technique which, while well known to 
mathematicians and engineers dealing with problems in solid 
mechanics, seems to have been virtually ignored by those of us in 
heat-transfer and fluid mechanics. In particular, the clarity with 
which the inter-relationships among the various extremal prob- 
lems has been demonstrated is one of the most important and 
far-reaching points of this paper. 

The writer having been aware of this particular work since 
1954, has applied the technique to a number of cycle-optimization 
problems which would have been extremely tedious to solve by 
the more conventional methods. 

A difficulty which often arises in connection with extremal 
problems involving several variables is that of establishing 
whether a true extremum (rather than a saddle-point) has been 
found, and whether the extremum is a maximum or a minimum. 
Equation [3] of the paper is a necessary, but not a sufficient con- 
dition, for the extremum. The writer would like to know whether 
the authors are aware of any relatively simple methods of in- 
vestigating the character of the stationary point. It has been the 
writer’s experience that it is often easiest actually to calculate 
values of the function in the neighborhood of the point in ques- 
tion when the number of independent variables exceeds two—a 
tedious procedure at best. 

Two rather minor points in the paper deserve comment here. 
The first is that in many applications the pressure drops asso- 
ciated with the sudden ¢ontraction and expansion of the fluids as 
they enter and leave the heat exchanger may be significant and 
these quantities should be included in the analysis. The second 
point is that the fractional pressure drop in the combustor may 
often be expressed as a function of 7, — 7; and thus included in 
the analysis with little additional effort. 

The paper clearly points to the need for systematic data on the 
p ~rance of heat-exchanger surfaces. Despite the monumen- 

*t by Kays and London’ we are still in the state where we 
must uften go through a great number of calculations, each as- 
suming different types of surface, in order to find what particular 
surface would result in the best heat-exchanger design. While 
the present paper shows how to find the optimum configuration 
once a surface is chosen, it is still often necessary to perform the 
calculations for a number of surfaces in order to arrive at the best 
configuration. An example of this sort of calculation (in which 
the optimization technique which was used was the conventional 
one) is given.* It would be extremely pleasant for the designer 
to have some sort of analytic function describing the variation 
in heat-transfer and friction-drop performance when going from 
one kind o° surface to another. 

Autuors’ CLOSURE 

The authors thank Mr. Aronson and Mr. Daskin for their 
careful reviews of the paper and for the questions which they 
raise. 

The prime purposes of the paper were to show that the method 
of Lagrangian multipliers can be simplified (by implicit elimina- 


®* Heat Transfer Analyst, Flight Propulsion Laboratory Depart- 
ment, General Electric Company, Cincinnati, Ohio. Assoc. Mem. 
ASME. 


7**Compact Heat Exchangers,” by W. M. Kays and A. L. London, 
The National Press, Palo Alto, Calif., 1955. 

8 “Selection of Optimum Configurations for a Heat Exchanger With 
One Dominating Film Resistance,” by E. R. G. Eckert and T. F. 
Irvine, ASME Paper No. 56—S-20. 


TRANSACTIONS OF THE ASME 


tion of the multipliers themselves) and to show that the method 
can be applied with profit to practical problems of plant design. 
The particular problems chosen for the exemplification of the 
method were only secondary; the choice was dictated to some ex- 
tent by a desire to cover most of the interesting and useful facets 
of the method with a minimum of tedium. Consequently, the 
authors take no umbrage at Mr. Aronson’s criticism of their 
choice of constraints for the second and third examples demon- 
strated in the paper. On the contrary, the question is welcome 
because it permits an added demonstration of the flexibility of the 
method to accommodate just such changes in constraints. 

Mr. Aronson is correct in saying that in the original statement 
of the second problem we have assumed that the two frontal areas 
per unit fluid-flow rate are fixed by prescribing values for M, and 
M,. The variables remaining in the problem are six: (Ap/p)r, 
NTU,, NTU,, c, €, and F. These are in turn related by Equa- 
tions [30], [31], and [32], so that three of the six variables are in- 
dependent. To accommodate Mr. Aronson’s suggestion that 
he would rather prescribe the frontal areas per unit net work, one 
would set free M, and M, and add to the previous three equa- 
tions (which remain unchanged) two new equations of con- 
straint. Thus, the net number of independent variables remains 
the same. The two new equations are 


Ws = Mle: — Ap/p)z = xne/F V 


where F, and F, are the prescribed values of air-side and gas- 
side frontal areas per unit net work, respectively. The elements 
of the matrix in Table 2 of the paper remain unchanged, except 
that the matrix itself becomes enlarged by two additional rows 
and two additional columns. As part of the solution, it is quickly 
found that Equations [16] through [20] remain the same as be- 
fore. The remainder of the work is tedious but not difficult in 
essence. 

Mr. Aronson is correct in saying that a crossflow exchanger 
in which one fluid is mixed and the other unmixed has a very low 
maximum effectiveness. Again, the choice was made on the basis 
of reasonable simplicity for the purpose of exposition. The exact 
relation between the functions F and ¢€ in Equation [32] for the 
case of both fluids unmixed (8) is indeed a formidable one for our 
present purposes; nevertheless, it could very well be closely ap- 
proximated, at least over an appropriate range of interest, by a 
less complicated analytical expression. It is worth repeating 
that such a change would affect only the third row of the matrix 
in Table 2 and none of the other work done to that point. 

In answer to Mr. Aronson’s last question, the method is equally 
applicable to the problem of minimizing cost, provided only that 
the problem can be stated in mathematical terms using differentia- 
ble functions. One of the authors has recently solved such a prob- 
lem (in a field other than gas-turbine plants) in which there were 
ten variables related by five equations of constraint. The basic 
question is, of course, the proper balance between fixed and 
variable costs and, as expected, the answers are functions of plant 
capacity factor and rate of amortization as well as unit costs of 
equipment and fuel. 

The authors are particularly pleased that Mr. Daskin has found 
this work useful. His question as to how to determine whether 
the stationary point is an extremum or a saddle point is well 
taken. Where f is a function of n z’s, all of which are independ- 
ent, the stationary point is a minimum if the n X n matrix 
||0*f/dx,d,|| is positive definite; i.e., if each of the determinants 


win 


. 
q 


APRIL, 1957 
for 7 is equal in turn to 1,2, . . .,n; when evaluated at the critical 
point. 

Where the n z’s are not all independent, but related by m con- 
straints, the situation is far less simple. The best the authors can 
offer at present is: Form the n X n matrix 


where the m yw’s are arbitrary multipliers, different in general 
from the Legrangian multipliers. Each of the determinants of 


661 


order higher than n — m formed from the leading minors of this 
matrix can be set separately equal to zero, and from the simul- 
taneous solution of these equations, the values of the m u’s can be 
found. The remaining matrix of order n — m must be positive 
definite for a minimum, rather than a saddle point, to obtain. 
This test is admittedly an impractical one unless the original ma- 
trix contains a large fraction of null elements. 

The authors heartily concur in Mr. Daskin’s desire to have an 
analytic function which relates the performance characteristics of 
variety of heat-transfer surfaces, but we politely decline the chal- 
lenge. 


os OY; | 


{ 
| 


Tests of Free Convection in a Partially 
Enclosed Space Between Two 


Heated Vertical Plates 


The free-convection heat-transfer coefficients were 
measured, as a function of spacing, for two parallel, elec- 
trically heated, vertical plates. The top of the rectangular 
space between the plates was left open, and during most 
of the tests the bottom and sides were closed. The heat 
input per unit area was substantially uniform, and the 
Grashof number, based on plate height, was of the order of 

_10". The results demonstrate how the surface-tempera- 
ture rise increases, or the local Nusselt number decreases, 
as either of the cross-section dimensions of the free- 
convection space is reduced. This decrease in Nusselt 
number is less rapid than was expected from purely two- 
dimensional flow considerations. This is ascribed to the 
effect of an asymmetrical flow pattern observed in the 
space. In some cases, a periodic reversal of this asym- 
metrical flow also was observed. When the space between 
plates was opened sufficiently at the bottom, the results 
were reasonably consistent with the correlation proposed 
by Jakob for a single vertical plate with a turbulent 
boundary layer; these results were almost independent of 
spacing bet ween the heated surfaces down to the minimum 
spacing tested. 


NOMENCLATURE 
The following nomenclature is used in the paper: 


b = breadth (width) of heated plate (see Fig. 1), ft 

¢, = specific heat at constant pressure, Btu/(lb deg F) 

g = acceleration of gravity, ft/hr? 

h = local coefficient of heat transfer = qg”/(t,,—t,,), Btu/(hr ft* 
deg F) 

value of h at x = 0.81L for two heated plates, with space 
closed at bottom and sides, and b = 0.75L, s = 0.28L 

thermal conductivity, Btu/(hr ft deg F) 


L = total height of plate, ft 
q” = heat flux per unit area at wall, Btu/(hr ft*) 
8 = spacing between a pair of plates (see Fig. 1), ft 
t = temperature, deg F 
V = average velocity of horizontal air stream across the top of 
the plates, fpm 
x = co-ordinate of height along flat plate, measured from bottom 


edge, ft 
z, = distance of bottom edge of plate above the floor, ft 


1 Aeronautical Research Scientist, National Advisory Committee 
for Aeronautics, Cleveland, Ohio; formerly, General Engineering 
Laboratory, General Electric Company, Schenectady, N. Y. Assoc. 
Mem. ASME. 

2 Consulting Engineer, General Engineering Laboratory, General 
Electric Company, Schenectady, N. Y. Mem. ASME. 

Contributed by the Heat Transfer Division and presented at a 
joint session with the Gas Turbine Power Division at the Semi-An- 
nual Meeting, Cleveland, Ohio, June 17-21, 1956, of THe Amenrt- 
can Society oF MECHANICAL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
31,1956. Paper No. 56—SA-5. 


By ROBERT SIEGEL! anp R. H. NORRIS? 


663 


8B = volumetric coefficient of expansion, deg F-! 
6* = displacement thickness of the boundary layer, ft 

BM = coefficient of viscosity, lb/(hr ft) 

v = kinematic viscosity, u/p, sq ft per hr 

p = density, pef 
Subscripts 

© = properties of air at ambient or room temperature (see 

Reduction of Data for discussion) 
L = location at, or value for, total height of plate 
w = location at wall (surface of heated plate) 


Dimensionless Groups 


= local Grashof number, g8(t,, — t.,)x?/v? 
Ner.. = total Grashof number based on total plate height, 
GB 
Nwu = local Nusselt number, hz/k 
Npr = Prandtl number, c,u/k 


INTRODUCTION 


The main purpose of this experimental investigation was to de- 
termine the free-convection heat transfer as a function of spacing 
between two parallel electrically heated vertical plates having a 
uniform heat input per unit area. The space between the plates 
was partially confined in most of the tests by closing the bottom 
and both sides with thermal insulation as shown in Fig. 1. 


TEST PLATES SIDE INSULATION 


GUARO PLATE 


GUARD PLATE b 


s FLOW (IF FANS USED) 


INSULATION WOODEN BAFFLE 


INSULA 


Scuematic ARRANGEMENT OF Test SURFACES FOR Most oF 
Tests 


Fie. 1 


To the authors’ knowledge, previous systematic data on the 
effect of spacing between a pair of heated plates have been availa- 
ble (1, 2, 3)* only for the case where the space between the 
plates was open at the bottom. A consideration of the free-con- 
vection boundary layers on plates with the space closed at the 
bottom led to the expectation that below some critical spacing the 
rising boundary layers would interfere with each other and hence 


3 Numbers in parentheses refer to the Bibliography at the end of the 
paper. 


664 


obstruct the supply of cool air from the top of the space. With 
further decreases in spacing the heat-transfer coefficient would be 
expected to decrease progressively. Owing to the difficulty of 
analysis for such a configuration, it was decided to resort to an 
experimental program to establish the magnitude of these geo- 
metrical effects. 

In addition to this basic configuration, several other geome- 
tries and effects were tested. These included changing the 
widths of the heated surfaces, blowing a stream of air horizontally 
at various velocities across the top of the space, heating only one 
plate with the other insulated, admitting air through openings be- 
tween the bottom edge of the surfaces and the floor, and opening 
the sides of the space. 

The height of the heated surfaces was 70 in., which resulted in 
Grashof numbers of the order of 10”, based on plate height. The 
boundary !ayers on the plates were therefore expected to be tur- 
bulent except near the bottom, and this was indeed found to be 
the case throughout the experiments. 


EXPERIMENTAL EQUIPMENT 


The basic configuration of the test surfaces is shown schemati- 
cally in Fig. 1. To help minimize stray heat losses each of the test 
surfaces was backed by thermal insulation and an independently 
heated guard plate. The heat input to the guard plate was ad- 
justed to keep its average temperature approximately equal to 
the average temperature of the test plate. Six layers of aluminum- 
foil insulation were provided between the test and guard plates to 
reduce to a very low value any heat flow resulting from the un- 
avoidable small local inequalities of temperature between the two 
plates. 

Each heated surface was constructed of three electrically heated 
panels each 70 in. high and 17.5 in. wide. These were fastened 
together by thin vertical aluminum moldings to form a single 
continuous surface about 53 in. wide. The moldings projected 
beyond the surface of the panel about '/; in. The heating panels 
were composed of a layer of bakelite insulation 0.020 in. thick 
covered on each side by a sheet of aluminum 0.005 in. thick. Very 
thin heating wires were imbedded in the central insulation 
layer. The spacing of these heating wires per inch was found to be 
substantially the same, 2.29 per in., for all parts of the panel; the 
wires themselves were not always perfectly straight, however, so 
that the spacing between a particular pair of wires sometimes 
varied as much as +10 per cent from the average. 

Calculations have shown that the edgewise thermal conductance 
of the thin aluminum sheets was low enough so that when con- 
sidering the 70-in-high plate as a whole, the surfaces had sub- 
stantially uniform heat input. Nevertheless, over the small dis- 
tance from one heating wire to the next, the edgewise thermal 
conductance of the aluminum was sufficient to maintain sub- 
stantially uniform temperature of the surface, unaffected by the 
occasional small nonuniformity of the spacing between adjacent 
wires, 

During most tests the space for free convection between the 
plates had a horizontal width b of 53 in. parallel to the plates, 
which were likewise 53 in. wide. In some tests, however, two thin 
vertical wooden partitions were inserted to reduce the width of 
the free-convection space. These partitions were arranged sym- 
metrically, so that the center of the test surface of reduced width 
remained at the center of the 53-in-wide heating surface. 

A horizontal air flow was provided across the tops of the 
heated panels in certain of the tests, by three axial-flow fans as 
indicated in Fig. 1. Although the resulting air flow was somewhat 
nonuniform and rather turbulent, it served to permit evaluation 
of the sensitivity of the test-plate temperatures to such a hori- 
zontal air stream, with a minimum expense for provision of the 


TRANSACTIONS OF THE ASME 


flow. The average velocity of this flow was measured with a 
velometer. 

Tests also were performed with horizontal wooden-strip turbu- 
lence promoters attached to the heated plates at vertical inter- 
vals of 6.5 in. These strips projected outward from the plates 
3/,in., and were 5/32 in. thick vertically. 


INSTRUMENTATION 


The total power input to the test panels was determined from 
voltmeter and ammeter readings with a wattmeter being used as 
acheck. The electrical load had negligible reactance, and hence 
a power factor of unity. 

Temperatures on the test panels were measured by “Stickon’’ 
resistance-thermometer elements (Ruge DeForest Company, type 
BN-1), which were cemented to the plates by a thin coating of 
Armstrong cement. Preliminary tests with elements placed at 
corresponding positions on both the exposed and insulated sides 
of a test panel showed good agreement with each other, and hence 
the elements were thereafter placed only on the insulated side so 
that lead wires would not interfere with the natural convection 
flow. 

Thermometer elements were located at vertical heights above 
the bottom edge of the plate of 3, 7, 12, 21, 36, and 61 in. One 
row of elements was provided in the central portion of the test 
surface. In addition, at the 7, 21, and 61-in. levels, elements were 
also located at points 9 in. horizontally inward from each of the 
outer vertical edges of the panels to provide an indication of 
the horizontal variation of the surface temperature. No attempt 
was made to explore more completely the temperature distribu- 
tion over the entire plate surface. 

The resistance of each element was measured by a Wheatstone- 
bridge circuit. The temperature rise of the element due to the 
current through the bridge was found to be negligible during the 
brief interval of current flow required while balancing the bridge. 
The bridge balance was measured by a light-beam galvanometer. 
A calibration chart and an individual correction factor for each 
element were furnished by the manufacturer to meet the accuracy 
called for by government specification Mil. B5495. These were 
spot-checked against a temperature standard for randomly chosen 
sample elements and the discrepancy found negligible. Correc- 
tions were of course made for the resistance of the lead wires. 

Ambient-air temperatures were measured by similar resistance 
elements, each suspended inside a vertical cylindrical radiation 
shield of aluminum foil. 

The air-flow patterns in the space between the plates were ex- 
plored to a limited extent by observing the flow of smoke from 
cigarettes attached to long probes inserted from above. 

Transient air-temperature fluctuations in the convection space 
were measured by thermistors of pin-head size with leads of fine 
wire soldered across the tips of two needles. These were mounted 
on long rods used as adjustable probes. The time constant of these 
thermistor elements was a small fraction of a second. The 
transient temperatures were recorded on a photoelectric re- 
corder. 

The major limitation to accuracy of the test results was the ex- 
istence of small temperature fluctuations, as explained later. 
These fluctuations were greater for some configurations than for 
others. 


REDUCTION oF Data 


Location of Ambient-Temperature Measurements. The ambient 
temperature was generally about five deg higher at the level of 
the top edge of the test plates than at the floor. The choice of the 
vertical location for the ambient temperature used in evaluating 
the heat-transfer coefficient of the plate therefore requires ex- 
planation 


APRIL, 1957 


In the tests of a single isolated plate, the ambient temperature 
was taken at the same height as the surface-temperature meas- 
urement being considered. 

In tests of parallel plates having their bottom edges raised off 
the floor, the ambient temperature was taken close to the floor, 
since all or most of the ambient air entering the region between the 
plates was sucked in from that level. This floor ambient was used 
even when the space between the heated surfaces was open for 
entry of air at the sides since it was believed that the air flow still 
came mostly from the bottom. 

In tests with the space between the plates closed both at the 
bottom and at the sides, the ambient temperature was taken at the 
level of the top edges of the plates, since the air entering the con- 
vection space came from approximately that location. 

Physical Properties of Air. The physical properties of air used 
to evaluate Nusselt, Grashof, and Prandtl numbers were taken at 
a temperature halfway between the average plate temperature 
and the suitable ambient-air temperature. The values chosen 
were consistent with reference (4). Since the local temperatures 
over the plate in any given test generally varied from the average 
by the order of only 10 deg F or less, no attempt was made to ad- 
just the air properties for this variation. 

Stray Heat Losses. The Nusselt-number results were not cor- 
rected for stray heat losses through the insulation nor for the heat 
losses by radiation through the top or side openings. The Nus- 
selt numbers obtained are therefore slightly higher than values 
representing pure free convection alone. The heat loss through 
the insulation was estimated to be about 5 per cent of the total 
input, and the radiation loss another few per cent, the latter vary- 
ing of course with the distance between the plates. The radiation 
emissivity of the plates was about 0.05 since the material was un- 
coated aluminum. 

Dimensionless Correlation of Results. The results were first 
correlated in dimensionless form by plotting, for each configura- 
tion, the local Nusselt number hz/k, as a function of the product 
of the local Grashof number, g@(t,, — t,,)xz?/v?, times Prandtl 
number, c,u/k. 

One reason for using this correlation was to compare the results 
with previously proposed correlations of this form for a single 
vertical plate. Another reason was that, except for the closely 
spaced configurations discussed later, the results obtained for one 
particular set of gas properties, temperature difference, and plate 
height, could be presumed applicable to any different set yielding 
the same Grashof-Prandtl-number product, without additional 
tests of the individual effects of these parameters. 

The local Grashof number is identical with the product 


The right-hand factor is the ‘‘total’’ Grashof number, based on 
the total height LZ. In the present tests the variation in local 
Grashof number is almost entirely due to variation of the factor 
(x/L); the total Grashof number varied only slightly throughout 
the tests. 

The slope of the correlation curves, in logarithmic co-ordinates, 
obtained by variation of the factor (z/L) in these tests, had prac- 
tically the same value as the slope obtained by previous investiga- 
tors for a single vertical plate when varying factors in the total 
Grashof number. Hence, in the present tests of parallel plates, if 
the total Grashof number were varied by changing the tempera- 
ture difference while (x/L) remained fixed, it would be expected 
that the Nusselt number would follow almost the same curve as 
that obtained by varying the (z/L)-factor. Since the Nusselt 
number depends on a small fractional power of Grashof number, a 
relatively large change in (t,, — t,,) would be required to deter- 
mine the magnitude of any small difference in the slope of the 


665 


curves for these two cases. It was not practicable to undertake 
tests for this purpose with much smaller or larger values of (t,, — 
t..); the accuracy would have become poor for very small values, 
and a much higher surface temperature would have damaged the 
electrical insulation in the heated panels. A few results obtained 
with a 20 per cent reduction in (t, — t,,) did not show any sig- 
nificant deviation from the curve obtained by variation of (x/L). 

Grashof-Number Effects for Close Spacings. The displacement 
thickness of a turbulent boundary layer on a single isolated verti- 
cal plate cooled by free convection has been found theoretically 
(6) to be given by the relation 


6*/x = 2.04(Nar)~'/*(N pr) 


In the derivation the free-convection heat-transfer correlation 
proposed by Jakob (7) was employed, which applies to the range 
of Grashof numbers considered here. 

When two heated plates are brought close together, the ratio 
s/L of spacing to height at which the boundary layers of the two 
plates will touch, or interfere by a definite amount, may therefore 
be expected to be proportional to (Nar) -/* for a given value of 
Npr. Since the 1/6 power is such a low power, it would take a 
relatively large change in (Nar); between two tests to establish 
the accuracy limitations of this hypothesis, and no attempt to do 
so was made in the present program. 

The results reported for the effect of s/L therefore apply only 
to the particular values of (Nor), representative of the tests; 
namely, about 1.8 (10). In the absence of data to the contrary 


it is suggested that heat-transfer correlations for other values of 
(Ner), in the turbulent range may be predicted by replacing s/L 
jn the present correlation by the quantity 


(s/L)(1.8 X 10/Ner.1)'/* 


Basis of Comparison for Different Configurations. To show the 
effect of changing a configuration by variation of one of its dimen- 
sions, it was desirable to cross-plot the results in another form. 
For this purpose the ordinate chosen is the ratio of the local heat- 
transfer coefficient h (for any particular configuration con- 
sidered) to the local heat-transfer coefficient h* for a particular 
configuration chosen as the standard of reference. The rather 
arbitrary choice for this reference configuration is the one with 
two heated plates, 53 in. wide, spaced 20 in. apart, with the bot- 
tom and sides closed, and the fans not running; h* was evaluated 
at the height « = 0.81L, for which NorNpr was approximately 
10”. 

One set of curves on the cross plots has been taken for a fixed 
value of the product of local Grashof and Prandtl numbers equal 
to 1.0 * 10”. This use of a common, fixed, Grashof-Prandtl- 
number product, makes the (h/h*) ratio closely represent the 
relative heat rate q” for a common fixed temperature rise (t,, — 
t,,) rather than the relative temperature rise for a fixed heat rate. 
A second set of curves has been cross-plotted for a Grashof- 
Prandtl-number product of 5 X 108, and a third for 1.85 & 107. 
These represent results at levels on the plate of 30 and 10 per cent, 
respectively, of the total height L, for the same temperature rise 
for which the first set of curves represents a level of 81 per cent. 
The reference heat-transfer coefficient h*, utilized in the second 
and third sets, has been retained at the same value as in the first 
set. 


REsULTS 


Surface Temperatures for Single Plate. The observed variation 
of surface temperature vertically along a single, isolated uni- 
formly heated plate is shown in Fig. 2. For about the bottom 12 
in, the temperature rises in proportion to about the 0.2 power of 
the height but remains nearly constant above the 12-in. level. 
This behavior is consistent with theoretical expectations when the 


666 


boundary layer is assumed laminar up to the 12-in. level, and tur- 
bulent above that. The theoretical slope of 1/5 is a consequence 
of the fact that the case considered here is one of uniform heat flux 
rather than the more customary case of uniform temperature, 
(see reference 5 or the more recent reference 8). 

Effects of Configuration Changes on Nusselt Numbers. The cor- 
relations of the test results in the form of Nusselt number as a 
function of the local Grashof-Prandtl-number product are pre- 
sented in Figs. 3 to 7. 


2 
10 ° 
: 
3 456 8 20 30 


HEIGHT- INCHES 


Fig. 2. Loca, Temperature Rise OF SURFACE OF A SINGLE 
70-In-High Unirormty Heatep VERTICAL PLATE AS A 
Function or Heicut ABove Borrom EpGe, ror = 


TRANSACTIONS OF THE ASME 


Fig. 3 presents both theoretical and experimental results for the 
single isolated plate, and also the experimental results for a pair 
of parallel plates, with the space between them open at both 
sides, at the bottom, and at the top. 

The theoretical correlation shown in Fig. 3 for a single plate 
with a laminar boundary layer, and uniform heat flux q” is that 
derived in reference (5) 


hz/k = . 


This is about 9 per cent above the corresponding correlation for 
uniform wall temperature from reference (7). 

The correlation proposed by Jakob (7) for a turbulent layer on 
a vertical plate, shown in Fig. 3, is 


ha/k = [2] 


This relation also applies for the case of uniform heat flux, as 
shown in reference (5). 

For a single plate, the test results in Fig. 3 for the turbulent 
range are 5 to 10 per cent above Equation [2]. This agree- 
ment is quite satisfactory, the discrepancy being attributed to 
stray heat losses. In the laminar range, which extends over 
about the lower 15 per cent of the plate, the discrepancy is some- 
what greater and is not fully accounted for. However, this is 


1.5 X 10” probably not too significant when making relative comparisons of 
Le 
aan 
5% 004 
+ 
| 
| | | 
| 


LOCAL GRASHOF NUMBER TIMES PRANOTL NUMBER- Mp, = 


Fie. 3 Test Resutts ror A Pair or VerticaL Heated PLates WITH Space BETWEEN THEM OPEN AT SIDES AND 
ComparRep Wi1TH Test REesuLTs AND Previous CORRELATIONS FOR A SINGLE PLATE ALONE 
(Plate height 70 in ; plate width 53 in.; clearance above floor 5 in.; average NGr,LNPr = 1.8 X 10°.) 


GEOMETRY | | T T T 
| | | 
+ e on 0.489 | 
a | | 
| = 
«01 | 
| 


LOCAL GRASHOF NUMBER TIMES PRANOTL WUMBER- Noy Np, = [sete 


Fie. 4 Test Resutts ror Same CONFIGURATIONS AND CONDITIONS AS IN Fic. 3 Excerpt THat Space BETWEEN Pair oF 
Piatres Was CLosep aT Sipes By INsuLATION ExTENDING Down To FLOOR 


APRIL, 1957 


the heat transfer from various configurations which have been 
tested. 

The results in Fig. 3 for parallel heated plates are generally 
slightly below those for the single plate, but are relatively insensi- 
tive to variation of the spacing between plates over the range 
tested. Carpenter and Wassell (2) observed a large decrease in 
heat transfer, but this was at a spacing of only 1 per cent of plate 
height. 

Fig. 4 presents the test results for the same configurations as 
Fig. 3 except that the space between the plates was closed at both 
sides by insulated panels which extended from the top all the way 
to the floor. Thus a vertical rectangular duct was formed open to 
the room at both bottom and top. A comparison of Figs. 4 and 3 
shows that closing the sides has only a very slight effect over the 
upper portion of the plates. The most significant effect is the 
appreciable increase in the heat-transfer coefficient near the 
bottom of the plates at the smaller spacings due presumably to the 
greater ‘‘chimney”’ effect with the sides closed. It is noted that 
decreasing the spacing tends to increase the laminar character- 
istics of the flow as evidenced by the slopes of the correlation 
curves. 


667 


the downward flow of fresh, cooler air in the middle portion of the 
space. At spacings as small as 10 per cent of the plate height, h 
is still about 80 per cent of the value it has for the largest spacing, 
whereas the boundary layer-displacement thickness at the top of 
each plate has been calculated from Equation [15] in reference (6) 
to be 5 per cent of the plate height, if unaffected by the presence 
of any adjacent plate. Thus for the 10 per cent spacing the two 
theoretical boundary layers would just touch each other and tend 
to block seriously the downward flow of cooling air. This would 
be expected to decrease h to much less than the 80 per cent value 
just mentioned. A reason for the better-than-expected heat trans- 
fer actually observed under these conditions will be considered 
later. 

The effect of adding the previously described turbulence pro- 
moters is also shown in Fig. 6(6) (cross-plotted from curves of the 
same type as Fig. 6(a) but not presented in this paper). It is 
evident that the effect of the promoters is relatively small. For 
the larger spacings between plates they increase h by a few per 
cent on the upper portion of the plates, but for the smaller spac- 
ings, and near the bottom part of the plate for all spacings, they 
appreciably decrease h. 


| 


St 


Fie. 


Test Resuvts ror Errect oF VARIATION OF CLEARANCE OF Borrom EpGe or PLAaTEes ABove FLOOR, FOR 


One CONFIGURATION OF Fic. 4; Namevy, A PLate Spacine or 3 In. 


Test results for successive decreases in the clearance opening 
between the bottom edges of the plates and the floor are pre- 
sented in Fig. 5(a) for one of the configurations considered in Fig. 
4. The particular horizontal spacing between the plates chosen 
for this purpose was 3 in. (0.043 L). The restrictions to the flow 
of air into the bottom of the vertical test duct were not thin sharp- 
edged plates, but flat ducts 14.5 in. long, since the insulated space 
on the back of each heated plate was 14.5 in. thick (see Fig. 1). 
The cross plot, Fig. 5(6), shows that for the particular test con- 
ditions, the heat-transfer coefficient was relatively insensitive to 
the clearance except for clearance below 1 per cent of the plate 
height. 

The effect of varying the horizontal spacing s between the 
plates (for the widest plates tested, and with the bottom and 
both sides closed) is shown in Fig. 6. For smooth plates, without 
the fans operating, the heat-transfer coefficient is relatively in- 
sensitive to the spacing for spacings above 20 per cent of the plate 
height. Although it decreases at spacings smaller than this, the 
decrease is not as severe as had been expected from estimates of 
the free-convection boundary-layer thickness on individual plates 
and from considerations of the interference of these layers with 


RELATIVE LOCAL HEAT- TRANSFER COEFFICIENT, n/n® 


/U HEIGHT OF BOTTOM EDGES ABOVE 

FLOOR, AS FRACTION OF PLATE HEIGHT 
Fic. 5(b) Cross Prot From Fie. 5(a) or Revative 
Heat-TRANSFER COEFFICIENT AS A FUNCTION OF CLEAR- 
ANCE oF Borrom Epce or PLates ABove FLOOR FOR 
Piate Spacine oF 3 In. 


(h* is value of h from Fig. 6(a) for z/L = 0.81, b/L = 
= 0.28, zero clearance at bottom and NarNpr = 10°. 


|| 
RES 


TRANSACTIONS OF THE ASME 


UN GEOMETRY T 
20° 
2 ooo | 
200}——— = + —++ 
= = 
4 
| 
| 
| | | 
— | (tw~ 
LOCAL GRASHOF NUMBER TIMES PRANOT. NUMBER Nor = iE 
Fic. 6(a2) Test Resuvts ror Pair or Heatep VerticaL PLates With Space BeTweeN THEM CLOSED AT 


Borrom as WELL AS AT 


Fria. 6(b) Cross PLot or Errect or 
Piate Spacine ON Revative Heat- 
TRANSFER COEFFICIENTS, FOR CoNn- 
FIGURATIONS AND CoNpbITIONS COoN- 
SIDERED IN Fia. 6(a) 


(Both plates heated. Results with addi- 
tion of horizontal strips, or of a horizontal 
air stream across the top, are also shown.) 


SIDEs; 


+ 


| 


RELATIVE LOCAL HEAT- TRANSFER COEFFICIENT, 
= + 
| | 


T 


RELATIVE LOCAL HEAT- TRANSFER COEFFICIENT Wh * 


2 }—___++_- — 
us =. 
2 ia 
+ 
uw T | 
| Mg, Mp, 
| | om [as FRacTION 
| Mor Mor 


PLATE SPACING, (S/L) 
(AS FRACTION OF PLATE HEIGHT) 


Aso Test Resutts AND Previous CORRELATIONS FOR A SINGLE PLATE ALONE 


(Horizontal ‘‘ticks’’ on left or right sides of test points refer to temperatures 9 in. from left or — sides, respectively, of bg plate; 
) 


other test points were taken near middle of plate; plate height 70 in.; plate width 53 in.; average NGr,pNPr = 1. 


Fic. 7(b) Cross Prior From 

7(a) or Errect or PLate SPACING ON 

Revative Heat-TRANSFER 

CIENTS FOR VERTICAL PLATES, WITH 

On ty One Piate Heatep, AND WiTH 

Space BeTweEN Puiates CLosep AT 
Srpes aNp Botrrom 


tT 


im 


gto 
see 
60 
° 
404 
Tite 
7(a) 


- 3 
LOCAL GRASHOF NUMBER TIMES PRANOTL NUMBER - 


Test Resutts ror SAME CONFIGURATIONS AS IN Fic. 6, Except ONLY One PLate Was Hearep Instead 


or Boru 


r 
668 
== 
| 
| 
| * ooo | == | 
| 
| 
| 


APRIL, 1957 


The effect of providing a horizontal air flow across the tops of 
the plates is also shown in Fig. 6(b). Near the tops of the plates, 
this air flow is seen to improve the heat transfer as would be ex- 
pected. The results plotted for z = 0.81L apply only to the 
heated plate nearest the source of the horizontal air stream. The 
values of h were considerably greater on the other heated plate at 
this level owing to the more direct impingement of the air stream. 
On the lower portions of the heated plate, the effect of the hori- 
zontal flow is relatively small, and at close spacings actually de- 
creases h, as a result perhaps of its interference with flow patterns 
described later. The effect in this region was found to be the 
same for both plates. 

Fig. 7 shows results for the same configurations as Fig. 6 but 
with only one plate heated and the other insulated. 

Fig. 7(6) is cross-plotted from Fig. 7(a) and from similar curves 
which are not shown. A comparison with Fig. 6(b) shows that 
the effect of plate spacing is not greatly different, but that with 
only one plate heated, the heat-transfer coefficient is 10 to 25 
per cent higher than for both plates heated. This is consistent 
qualitatively with the greater space provided for downward flow 
by the absence of an upward flowing boundary layer on the un- 
heated plate. These values of h are also seen to be higher than the 
experimental results for the single plate [compare Figs. 7 (a) and 3]. 

The effect of decreasing the width 6 of the convection space for 
each of two different, fixed, spacings between the pair of heated 
plates is shown in Fig. 8. The results show a significant decrease 
in the heat-transfer coefficient, even for widths considerably 
greater than the spacing between plates. 

Flow Patterns and Flow Pulsations. Figs. 9 and 10 pertain to 
the results of explorations of the flow patterns in the space be- 


LEGEND 


HORIZONTAL 
VELOCITY 
FT/MIN 


SPACING 
%~OFL 
21.4% 
21.4% 
2.4% 


350 
650 


RELATIVE LOCAL HEAT- TRANSFER COEFFICIENT, h/h * 


X*0.10L 
Npr= 1.65 X10 
5 


a2 38 of OS o86 @ 


PLATE WIDTH, (b/L), (AS FRACTION OF PLATE 
HEIGHT) 


Fie. 8 Cross Piotr or Errect or Wiptrs or Test Duct (CLosep 

aT Sipes aND Borrom) oN Revative Heat-TRANSFER 

CIENTS, FOR Two DirFERENT PLaTe Spacines, WiTH AND WiTHOUT 

Horizontat Arr Srream Across THE Top; Born PLatTes 
Heatep; AVERAGE = 1.8 K 10” 


669 


tween plates, with the bottom and sides closed. Fig. 9 represents 
schematically the pattern which might be expected from two- 
dimensional considerations of the boundary-layer formation on a 
single heated vertical flat plate. This was substantially the pat- 
tern observed for a pair of heated plates when the spacing be- 
tween the plates was large; i.e., 28 per cent of the plate height. 


Fic. 9 Two-DimensionaL Scuematic INpIcaTING FLow 

Parrern Wuicu Micut Be Expecrep BeTweeN A Pair oF VERTI- 

caL Heartep Wits Space Between THEM CLOSED AT SIDES 
AND Borrom 


HEATED SURFACES 


Fie. 10 Turee-DimensionaL Scuematic Diacram INDICATING 

ASYMMETRICAL FLow DisTRIBuTION OBSERVED AT Top or A Duct 

CLosep at Borrom Sipes, WHEN Spacinc Between HEATED 
Piates Was 21 Per Cent or Less or Herout 


When the spacing was reduced to 0.21L, the width being 0.75 
L, the interference of the upward flowing boundary layers with 
the downward flow, required to feed those layers, apparently 
modified the flow patterns. The cross section occupied by the 
downward flow no longer extended over nearly the full width of 
the plate in a symmetric manner, but was confined to about one 
half the width, as indicated by the solid arrows in Fig. 10, with 
the other half width having only upward flow. This asymmetric 
pattern is presumably the reason why the heat-transfer coefficient 
fell off less rapidly, with decreasing spacing, than had been ex- 
pected. 

With spacings of 0.21L or less, and widths of 0.75L, a periodic 
fluctuation in the flow velocity was observed in the region be- 
tween the plates. When the spacing was reduced to 0.07L, the 
fluctuation amplitude was so large that actual reversals in flow 
direction took place periodically as indicated by the dotted 
arrows in Fig. 10.. Similar reversals occurred at a spacing of 0.04L. 
Records of the temperature of this fluctuating air flow were 
consistent with flow observations made with smoke; the air tem- 
perature near the top, halfway between the heated plates and near 
one side, was close to room temperature at the same time that the 
air temperature at a similar point near the other side was much 


Z 
| 


670 


warmer. A moment later the temperature at the first point 
would rise, while the temperature at the second point would drop 
to about room temperature. The average period of the flow re- 
versals was about 20 sec, but with considerable variation from 
eycle to cycle. 

When the width of the test duct was reduced to 0.28L, with 
plate spacing at 0.21L the pattern represented by Fig. 10 was no 
longer evident. The temperature of the air halfway between the 
heated plates and near one side showed irregular fluctuations 
representative of turbulent flow, but at intervals of from 5 to 40 
sec abruptly dropped to a value close to room temperature for 1 
to 2 sec duration. 

For a duct with an intermediate width of 0.53L, and a plate 
spacing of 0.21L sometimes one and sometimes the other of the 
foregoing types of flow and temperature fluctuations was ob- 
served. This resulted in different values of heat-transfer co- 
efficients in successive tests, as indicated in Fig. 8. 


CONCLUSIONS 


The following conclusions apply to a vertical duct of rectangu- 
lar cross section with the two wider walls uniformly heated and 
having a Grashof-Prandtl-number product of about 1.8 < 10”, 
based on total height of the plates: 


1 When the duct was open at the bottom the local heat-trans- 
fer coefficients were affected only slightly by the following: 


(a) The presence or absence of vertical enclosing walls at the 
sides. 

(b) The spacing between the heated walls, down to the mini- 
mum spacing tested (2 per cent of duct height). 

(c) The clearance of the bottom edge above the floor, down to 
a clearance of 1 per cent of the duct height. 


2 When the duct was closed at the bottom and at the sides, 
the following effects were observed: 


(a) The local heat-transfer coefficients were appreciably af- 
fected over most of the ranges tested, by both duct width and the 
spacing between the heated plates. 

(b) The air-flow pattern in the duct became asymmetric, with 
a tendency to pulsate or reverse in direction for certain geometries; 
namely, ducts of large width and relatively close spacing between 
the heated plates. 

(c) Provision of a horizontal air stream across the top of the 
duct can cause a small decrease in the heat-transfer coefficients of 
the lower portions of the plates. It tends to increase the co- 
efficients for the upper portions except when the spacing between 
plates is quite small. 

(d) Provision of horizontal turbulence-promoter strips, on the 
heated plates, results in no significant change in the heat-transfer 
coefficient except at the closest plate spacing tested, where it re- 
sults in a considerable decrease due to blocking the air flow. 

(e) The heat-transfer coefficient for a heated plate is somewhat 
larger when the plate faces an unheated plate rather than a 
heated one. 


BIBLIOGRAPHY 


1 ‘Heat Dissipation of Parallel Plates by Free Convection,’’ by 
W. Elenbaas, Physica, vol. 9, January, 1942, p. 1. 

2 ‘The Loss of Heat by Natural Convection From Parallel Vertical 
Plates in Air,” by L. G. Carpenter and H. C. Wassell, Proceedings of 
The Institution of Mechanical Engineers, London, England, vol. 
128, November-December, 1934, pp. 439-457. 

3 “Studies on Heat Transfer in Laminar Free Convection with the 
Zehnder-Mach Interferometer,’”’ by E. R. G. Eckert and E. E. Soehn- 
gen, Air Force Technical Report 5747, December, 1948. 

4 “The NBS-NACA Tables of Thermal Properties of Gases— 
Tables 2.10, 2.39, and 2.42 (Dry Air),”” by H. W. Woolley, F. C. 
Morey, and R. L. Nutall, National Bureau of Standards, 1950. 


TRANSACTIONS OF THE ASME 


5 “Analysis of Laminar and Turbulent Free Convection From a 
Smooth Vertical Plate With Uniform Heat Dissipation per Unit Sur- 
face Area,’’ by Robert Siegel, General Electric Report 54GL89, April, 
1954. 

6 ‘Analytical Investigation of Flow and Heat Transfer in Coolant 
Passages of Free-Convection Liquid-Cooled Turbines,”’ by E. R. G. 
Eckert and T. W. Jackson, NACA RM E50D25, July, 1950. 

7 “Heat Transfer,’’ by M. Jakob, John Wiley & Sons, Inc., New 
York, N. Y., 1949, p. 530. 

8 ‘Laminar Free Convection From a Vertical Plate With Uniform 
Surface Heat Flux,’’ by E. M. Sparrow and J. L. Gregg, Trans. 
ASME, vol. 78, 1956, pp. 435-440. 


Discussion 


FraNK Krertu.‘ The data presented by the authors represent 
a valuable contribution to the understanding of free convection in 
partially enclosed spaces. This type of information is not only 
important to the development of free-convection cooling systems 
for turbine blades and nuclear power plants, but it is also of 
fundamental interest. 

In the following discussion some of the results presented in the 
paper are re-examined and compared with published information 
on related systems and unpublished data obtained at Lehigh 
University on a geometrically similar system. It is hoped that 
these remarks will foster a continued interest in the fundamental 
aspects of the problem. 

An inspection of Fig. 3 reveals that the experimental data for 
the lower portion of a single vertical plate, where the flow is be- 
lieved to be entirely laminar, fall 20 to 30 per cent above the line 
representing a theoretical equation predicted by Siegel, and de- 
viate even further from theory when compared with an analysis 
by Sparrow and Gregg (9)° which gives a constant of 0.408 com- 
pared to the value of 0.453 in Equation [1]. Similar experimental 
results were obtained at Lehigh University with an electrically 
heated copper plate, 3 ft tall and 1 ft wide, in a range of Grashof 
numbers in which over */; of the plate is covered by a turbulent 
free-convection boundary layer. In view of the deviation of these 
experimental results from theoretical predictions for purely 
laminar flow, one wonders whether the turbulent free-convection 
boundary layer could have an influence on the laminar boundary 
layer below it and whether equations for purely laminar flow apply 
to free convection in systems where both laminar and turbulent 
boundary layers are present. The reliability of the authors’ ex- 
perimental data is strengthened further by comparing some of 
their results with experimental data obtained by Elenbaas (1) on 
a geometrically similar system. A rearrangement of the data 
presented in Fig. 3 in terms of the dimensionless numbers used by 
Elenbaas, hs/k and gBAT's‘c,u/Lkv*, shows that the data for free 
convection between parallel plates with open bottoms agree quite 
well with those of reference (1). 

The dimensionless number g8A7's‘c,u/Lkv* or Nor,.* (8/L) 
can also be used to advantage in correlating data of other con- 
figurations. Using the best estimates possible from the small-sized 
graphs in the preprint, the data shown in Fig. 6(a) are replotted 
in Fig. 11, where also some experimental points obtained at Le- 


_ high University on a similar system are shown. The system used 


in the tests at Lehigh University was constructed by Mr. A. W. 
Stubner, a former graduate student in the department of me- 
chanical engineering, as a part of his master’s program. It con- 
sists of two parallel vertical copper plates 3 ft high and 1 ft wide, 
each of which can be heated electrically by passing current 
through heating wires imbedded between sheets of mica on the back 
sides of the plates. The surfaces of the plates on which the heat- 


4 Associate Professor of Mechanical Engineering, Lehigh University, 
Bethlehem, Pa. Assoc. Mem. ASME, 

5’ Numbers from 9 to 12 in parentheses refer to the Bibliography at 
the end of this discussion. 


APRIL, 1957 


ELATION OF EXPERIM 


ENTAL DATA SHOWN 


4 


2” SPACING|_ 


ELENBAAS (REF 1) 


(OPEN BOTTOM) 


i" SPACING (UNSTEADY FLOW) 


61 INCHES FROM BOTTOM 
36 INCHES FROM Borrom 
12 INCHES FROM BOTTOM 


DATA OBTAINED AT LEHWIGH UNIVERSITY 
WITH PLATES OF 3 FT. HEIGHT 


(S/L) 


Fig. 11 


ing wires are installed were covered with asbestos insulation. The 
heat loss through the insulation was measured as a function of the 
plate temperature by placing the plates face to face, heating them, 
and measuring their temperature and the heat dissipation at 
equilibrium. The plate temperatures were measured at various 
locations with iron-constantan thermocouples peened into the 
copper plates from the rear. The results of two tests with the 
plates 2 in. and ''/,, in. apart, sides and bottom closed, are shown 
in Fig. 11. They are in good agreement with the data from Fig. 
6(a), and suggest that a Grashof number based on the distance 
between the plates may be a significant indication of the turbu- 
lence for this system, somewhat analogous to a Reynolds number 
based on the hydraulic diameter or radius in forced convection. 
The use of the plate spacing as the significant length dimension 
appears to be particularly useful (1, 10) when that distance is so 
small that the entire space between the plates is filled with fluid 
influenced in some way by viscous forces. 

It is of interest to note that in those systems where the bottom 
was open (see Figs. 3 and 4) the curves representing the Nusselt 
numbers for the smaller openings cross the curves representing 
the Nusselt numbers for the larger openings. This trend is not 
observed in any of the systems in which the bottom was closed. 
This discusser would like to ask if an explanation for these phe- 
nomena can be found from an examination of the flow in a man- 
ner similar to that employed by Lighthill (10), or Batchelor (11) 
in their respective analyses of free convection problems. 

The description of the periodic flow fluctuations which were ob- 
served when the distance between the plates was reduced to less 
than 20 per cent of their height is possibly the most interesting 
phase of the experimental work. Lighthill (10), in his study of 
free convection in a tube closed at the bottom, showed theoreti- 
cally how the flow pattern changes when the boundary layer be- 
comes so thick at the top that the inflow of colder fluid is choked. 
His results, however, do not apply here because in a tube, sym- 


Fig. 12) Sxercues REVERSAL 

metry of the flow is maintained even after choking, whereas in a 
narrow space between flat plates the flow pattern ceases to be 
symmetrical about the center plane (see Fig. 9) and becomes cir- 
culatory. Although the appearance of this type of motion is 
physically not unexpected since it allows for continued flow and 
effective heat transfer after the central inflow pattern becomes 
choked, a reason for its unsteadiness cannot be found readily. 
However, similar flow instabilities have been observed by Sparrow 
and Kaufman (12) in a narrow vertical space fed by fluid from a 


672 


heated reservoir at the bottom and cooled at the top, and their 
observations suggest an explanation for the flow reversal ob- 
served by the authors. It may be noted also that, in the system 
treated in the present paper, the natural tendency of heated fluid is 
always to rise. Hence, in a narrow channel fed by fluid from the 
top also the inflowing fluid is subject to a buoyant force which in 
the case of circulatory motion is, however, weaker than the 
downward forces produced by upward moving fluid. But, as 
shown in Fig. 12, there must be somewhere in the space between 
the plates some fluid which is neither moving upward nor down- 
ward because it is located at an intermediate boundary where the 
velocity is zero. This boundary will shift as the stationary fluid 
rises under the influence of buoyancy and thus reduce the area 
available for downflow. The shift continues until the boundary 
has moved so far that the inflow becomes choked. The choking 
action could coriceivably produce a flow reversal as shown by the 
sketches in Fig. 12. 

This discusser would like to suggest that the authors present in 
their closure typical samples of data showing the temperature 
variations in the fluid at the open end as well as the variations in 
surface temperature (that is of the heat-transfer coefficient) during 
unsteady flow. This information would be of help in further 
studies of flow phenomena associated with free convection in 
partially confined spaces. 


BIBLIOGRAPHY 
9 “Laminar Free Convection from a Vertical Plate with Uni- 
form Surface Heat Flux,’’ by E. M. Sparrow and J. L. Gregg, Trans. 
ASME, vol. 78, 1956, pp. 435-440. 

10 ‘Theoretical Considerations in Free Convection in Tubes,”” by 
M. J. Lighthill, Quarterly Journal of Mechanics and Applied Mathe- 
matics, vol. VI, 1953, pp. 398-439. 

11 ‘‘Heat Transfer by Free Convection Across a Closed Cavity 
Between Vertical Boundaries at Different Temperatures’’ by G. K. 
Batchelor, Quarterly Journal of Applied Mathematics, vol. XII, No. 3, 
1954, pp. 209-233. 

12 ‘Visual Study of Free Convection in a Narrow Vertical En- 
closure,” by E. M. Sparrow and 8. J. Kaufman, NACA RM No. E 
55L 14a, Feb. 16, 1956. 


Lioyp TREFETHEN.’ It would be helpful if the authors could 
provide more information about the asymmetrical flow pattern of 
Fig. 10, and an explanation of the periodic reversal of that flow 
which they observed. The high density of the fluid moving down 
compared to that of the hotter rising fluid, together with the 
equality of pressure of fluid as it enters and leaves, would seem to 
tend toward stability of the asymmetrical flow once it has estab- 
lished itself in either direction. Yet the authors report the inter- 
esting fact that such a flow is not always stable. 

The somewhat similar flow patterns obtained by Timo (13)? 
were stable. There, liquid metal in a narrow annulus between 
vertical adiabatic cylinders was frozen at the top, and contacted 
a pool of hot liquid metal at the bottom of the cylinders. Per- 
haps the rising hot streams scalloped the metallic ice in such a way 
as to provide a stabilizing geometry. On the other hand, Sparrow 
and Kaufman (14) cooled the top of water between vertical 
adiabatic glass plates which were open at the bottom to a heated 
reservoir. They observed violently unstable and irregular con- 
vective patterns. 

This large-scale departure from two-dimensional flow conditions 
should perhaps be anticipated in some other instances of convective 
flow between vertical boundaries. For example, thermopane 
windows are frequently subjected to uneven temperatures, inside 
by heating currents, and outside by winds. These uneven effects 
can be seen on single-thickness glass windows in the winter, when 


6 Assistant Professor, Division of Engineering and Applied Physics, 
Harvard University, Cambridge, Mass. Assoc. Mem. ASME. 

7 Numbers from 13 to 15 in parentheses refer to Bibliography at 
the end of this discussion. 


TRANSACTIONS OF THE ASME 


the boundaries between the drop-condensation, film-condensa- 
tion, freezing, and sublimation regions show large dips and peaks. 
Nonhorizontal isotherms on thermopane would tend to cause 
large-scale eddies of the air between the panes similar to those 
observed by Timo and by the authors. If such eddies did occur, 
the thermal resistance of the window might differ from the re- 
sistance predicted from a two-dimensional analysis, such as 
Batchelor’s (15). 


BIBLIOGRAPHY 


13 ‘‘Free Convection in Narrow Vertical Sodium Annuli,” by D. 
P. Timo, U. 8S. Atomic Energy Commission Technical Information 
Service, Oak Ridge, Tenn. Report KAPL-1082, March 5, 1954. 

14 ‘Visual Study of Free Convection in a Narrow Vertical En- 
closure,”’ by E. M. Sparrow and 8S. J. Kaufman, National Advisory 
Committee for Aeronautics Research Memorandum E 55L 14a, Febru- 
ary 16, 1956. 

15 ‘Heat Transfer by Free Convection Across a Closed Cavity Be- 
tween Vertical Boundaries at Different Temperatures,’’ by G. K. 
Batchelor, Quarterly Journal of Applied Mechanics, vol. XI!¥, No. 3, 
1954, pp. 209-233. 


AutTuors’ CLOSURE 


The authors would like to thank the discussers for their interest 
in and comments on the subject of free convection in confined 
spaces. 

The data supplied by Professor Kreith for heat transfer be- 
tween two heated plates are very welcome as they provide inde- 
pendent information which tends to confirm the authors’ find- 
ings for this geometry. It was interesting that his experiments 
using a single plate also yielded values in the laminar region which 
were higher than the theoretical laminar correlations. Contrary 
to the statement in the discussion, the analysis of Sparrow and 
Gregg (reference 8 of the paper) is in good agreement for Pr = 
0.7 with that of Siegel (5). For, if we use Fig. 4 of reference (8) 
and also the notation of that reference and note the definitions 
of Gr,* and of Nu,, we have Nu,/Gr,*'/* = Nu,/(Gr,Nu,)'/* = 
0.5. Then, for Pr = 0.70, Nu, = (0.5)'/* (Gr,Pr)'/*/(Pr)'/* = 
0.458 (Gr,Pr)'/*. This factor 0.458 agrees substantially with 
0.453 in Equation [1] of the paper, which came from reference 
(5). 

Professor Kreith comments on the crossing of the Nusselt- 
number curves in Figs. 3 and 4. An analysis of this system 
might require a consideration of the laminar to turbulent transi- 
tion for free convection in a channel, which is not well under- 
stood at present. Ignoring this transition effect, however, the 
following explanation of the crossing of the curves seems not un- 
reasonable. As the plates are brought closer together, the re- 
sulting increase in air temperature between the upper portions of 
the plates creates a better draft (chimney effect) which increases 
the upward velocity between the lower plates. This improves 
the heat transfer there, since the boundary layers are then thin 
enough so as to leave room for unheated ambient air between 
the two layers. This effect is greater when the sides are closed 
(Fig. 4) than when they are open (Fig. 3) as already mentioned 
in the paper. But between the upper portions of the plates the 
thermal boundary layers have merged, so closer spacing means 
higher air temperature at the center, which reduces the heat 
transfer. 

With reference to Professor Trefethen’s comment on the ob- 
served flow reversals, the authors are unable to supply additional 
information to clarify the origin of this flow configuration. Pro- 
fessor Kreith offers a possible explanation, and it would be de- 
sirable to have additional experimental information on the de- 
tails of the flow during the oscillations to establish his hypothesis. 

To better illustrate the nature of the oscillations, Figs. 13 and 
14 herewith show typical recordings of air-temperature varia- 


APRIL, 1957 


Babe 


Ain Temperatures Hatrway Between Heatep PLatTes 
AND Leve. Top 


(Solid curve for point 0.06L from one side; dotted curve 0.03L from other 
side. b/L = 0.75; s/L = 0.07.) 


Fie. 13 


tions in the space between two heated plates with the sides and 
bottom closed. Recordings of this nature are discussed toward 
the end of the paper in the section on flow pulsations. No at- 
tempt was made to record the fluctuations in plate temperature 
since the output of the resistance thermometers attached to the 


+ 


Fic. 14. Arr Temperatures Hatrway Berween Heatep PLatTes 
anD Leve. Top 


(Solid curve for point 0.03L from one side; dotted curve 0.03L from other 
side. 6/L = 0.53, 8/L = 0.07.) 


heated surfaces would not be representative of the temperature: 
fluctuations of the portions of the plates where these thermome- 
ters were not present, owing to differences in time constant of 
temperature response to changes in the convective heat-transfer 


coefficient. 


673 
r ne at 


a 


A Mechanical Computing Device tor the 
Analysis of One-Dimensional, Transient, 
Heat-Conduction Problems 


By W. E. HOWLAND,' E. A. TRABANT,? ano G. A. HAWKINS,* LAFAYETTE, IND. 


In 1953 one of the authors learned that Dr. P. Hacke- 
mann, the year before, had designed and constructed a 
mechanical device for use in the solution of one-dimen- 
sional, transient, heat-conduction problems (1, 2).4 The 
device replaces the laborious task of line drawing required 
in the Binder-Schmidt graphical method by a semi- 
automatic machine operation. This paper covers an ex- 
tension of the Hackemann idea to the solution when the 
thermophysical properties vary with temperature. Be- 
fore introducing the suggested mechanical computer, a 
brief discussion of the Binder-Schmidt graphical method 
for solving one-dimensional, transient, heat-flow prob- 
lems is given. 


ILLUSTRATIVE Use oF BinpER-Scumipt METHOD FOR A SIMPLE 
Heat-TRANSFER PROBLEM 


HE Binder-Schmidt procedure is basically a graphical solu- 
tion of a finite-difference equation which replaces the basic 
partial differential equation used to express the transient- 
conduction heat transfer in the system. 
For the purpose of illustration consider the partial differential 
equation for unidimensional flow of heat in a bar of uniform cross 
section and length | 


where 


¢ = temperature in rod at any position 

time 

a thermal diffusivity of rod material, or 
a k/pe 

k thermal conductivity of material 

c specific heat 

p = density 


For initial condition it is assumed that 
t = (x), 


where é(z) is an arbitrary function, and the boundary conditions 
are taken as 


1 Professor of Sanitary Engineering, Purdue University. 

2 Associate Professor of Engineering Scierces, Purdue University. 

3 Dean of Engineering, Purdue University. Mem. ASME. 

* Numbers in parentheses refer to the Bibliography at the end of the 
paper. 

Contributed by the Heat Transfer Division and presented at the 
Semi-Annual Meeting, Cleveland, Ohio, June 17-21, 1956, of Tue 
AMERICAN Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
31, 1956. Paper No. 56—SA-27. 


6 


t 
t — 


To solve Equation [1] numerically it is usual to replace the 
derivatives with respect to time or distance by finite differences. 


If 
and 


are the temperature distributions at the beginning and end of the 
nth time interval (7,, T.+:) of length Ar, then 

k dXt,) 

dx? 


tats — 
Ar pve 


With the following definition 


Equation [4] may be written 
d*t,,) 


= (bn — 


If ¢,,, , and tii, , represent the temperature at the beginning 
and end of the mth distance interval (zm, 2%m+i) of length Az and 
at the beginning of the nth time interval, then Equation [6] may 
be written as 


B? 

Equation [7] is a formula of recursion which indicates that the 
temperature at a position z = mAz may be calculated for the in- 
stant r = (n + 1)Arif the distribution in the neighborhood of z is 
known at tr = nAr. 

A graphical solution of Equation [7] may be accomplished in 
the following manner: In Fig. 1 the curve a-c-e-g-i-k represents 
the arbitrary initial condition, Equation [2], and the points a and 
k correspond to the boundary conditions, Equation [3]. The bar 
has been divided into 10 increments Az, as shown. 


If a straight line is drawn from e to g, it is evident that 


1 
ff' = 2 (te.0 + ts.0) — 


1 
ff' = 2 (ts.0 — 2ts.0 + tao) 


Comparison of Equations [7] and [8] shows that if the quantity 
ff’ is multiplied by the factor (8/Ax)* the result will be the in- 
crease of temperature at the position z = 5Az in a time interval 
Ar. If the points 6, d, h, and j are found in a manner similar to f 
then these points represent the temperature at the time 7 = Ar. 


5 


at z=0 | 
te at z=l | 
1 
: 
or ox? 
or 


Fie. 1 


A continuation of this construction will produce lines of tempera- 
tures in equal steps of Ar. 

It is observed that, if the graphical procedure is carried far 
enough, the end result will be a straight line connecting the points 
aandk. This agrees with the analytic steady-state solution. 

Moreover, since it is always possible to choose values of Az and 
Ar such that 


1 
B?/(Az?) = 2 or Ar = (Az)*/3a......... [9] 
Equation [7] may be written in the simplified form 
1 
bm, on = 2 n + {10} 


HACKEMANN IDEA 


In this example it is easy to see that an elastic string could be 
made to take the successive positions of the temperature-distribu- 
tion curve if it were operated in the following manner: First, let 
it be stretched to take the positions a-c-e-g-i-k. Then let it be held 
at the positions a-b-d-f-h-j-k and released at points c, e, g, i. Then 
let it be clamped in these new positions where the string crosses 
the vertical lines above the points c, e, g,7, and soon. The string 
will always be held at a and k but alternately on the solid vertical 
lines. A mechanism that alternately would clamp the string 
along every other vertical line and then on the others is not dif- 
ficult to imagine. Such a device has been constructed by Dr. 
Hackemann. 


Fiow or Heat Composite WALLS 


The possible use of an elastic-string computer in the study of 
temperature distributions in composite walls provides an interest- 
ing illustrative example. 

There are two principal difficulties in the way of applying the 
elastic-string method to this problem: (a) When the string is 
stretched across the interface, it naturally will be a straight, un- 


TRANSACTIONS OF THE ASME 


Fig. 2) ActuaL WaLLs 


Fic. 3 SusstiruTe WaALLs 


This 


broken line as the support at the interface is removed. 
straightness of line would imply that the temperature gradient is 
a constant across the interface, which it certainly is not when the 
k-values of the two materials are different. (b) When the k-values 
of the two materials are different, the calculated values of Ar 
would be different in the two sections as shown by Equation [9] 
providing Az is the same throughout. Under these conditions, 
the string would have to move more rapidly through part of its 
length than in the other. This would require two strings and is 
impractical. The remedy for the second difficulty is obviously to 
change the value of Az in the different parts of the path to make 
Ar constant everywhere and this can be done readily. The remedy 
for the first difficulty is to replace each section of wall by a dif- 
ferent one of different thickness, but of the same conductivity as 
every other and of resistance to heat-flow equivalent to that of 
the wall replaced and of equivalent total heat capacity. This also 
may be done as will now be shown. 

Replace the actual wall shown in Fig. 2 by the nearly equivalent 
wall shown in Fig. 3. The difference is that the material desig- 
nated by subscript 2 has been replaced by another one whose con- 
ductivity value k; is equal to k;, and whose thickness has been 
changed so that the resistance to the steady flow of heat is the same 
as that of the material replaced. This means that the thickness of 
the new material is 


k 3 l ak 1 


ke [11] 


= I, 


The new wall also must have the same heat capacity as the wall 
replaced. This means that the product css, of the material of the 


676 
— — — 
a, 
k k, k,> ky 
k, 
I, 
| | 


APRIL, 1957 


new wall must be changed in the ratio of the lengths or the con- 
ductivities. Therefore 


= C » 
3P3 


For example, if the length has been shortened as shown, i.e., if 
l, > 1, then more heat capacity must be crowded into the same 
space so that > cape. 

The two walls shown in Fig. 3 are now exactly equivalent to 
those shown in Fig. 2. Since the conductivity is the same 
throughout, a constant difference in temperature between the 
outsides of the combined wall of Fig. 3 will produce a straight 
unbroken temperature line for steady flow. Furthermore, for 
unsteady flow the slope of the temperature gradient at the inter- 
face will be continuous. 

It now appears that the elastic-string analogy to the flow of 
heat through walls of different material will present no serious 
difficulties at the interface. Across the interface, when not held 
fixed at the interface, the elastic string will be pulled straight, im- 
plying that the temperature gradient is continuous at the inter- 
face—which is true when the conductivities are arbitrarily ad- 
justed to be the same in both wall materials, by Equations [11] 
and [12]. 

It is recalled that the time intervals Art must be the same for 
all materials since one string must be used for the entire gradient. 
This is accomplished by keeping Ar the same throughout. When 
a, is different from a@ than Az; must differ from Az, in accordance 
with Equation [9]. In the illustrative example presented, since 


CaPsls = Copal 


ks = k, and 
then 
an = = = 
ls ks ky 
and 


An (2a,A7)'"* y(“) [13] 
Az, (2a,Ar)'/? a, 


With composite sections an additional difficulty may often be 
encountered. After an appropriate choice of Az; has been made 
and Az; has been calculated, it may be found to be not an aliquot 
part of the wall thickness /;. In other words, the total equivalent 
thickness /;, of the second part of the wall may not be an integral 
multiple of the calculated value of Az;. Suppose that J; divided 
by Az; comes out to be n, an integral number, plus a fraction f. 
Choose n as the number of sections and divide the length /; into 
this number of sections. Place clamping bars of width 


(ls nAz;)/n = fAz;/n {14] 


at the boundaries of these sections each J;/n wide. The center 
line of a clamping bar should coincide with the boundaries of the 
sections. At the interface and end place clamping bars of one 
half this width. If a drafting-board procedure is to be used, allow 
a gap of these magnitudes to exist between the ends of the chords 
as shown in Fig. 4. The chords will have a horizontal projected 
width of exactly Az; as calculated. 

If the cross-sectional areas of the two sections are different as 
well as the corresponding thermal constants, then the second sec- 
tion is replaced by a third whose resistance to heat flow and whose 
total heat capacity are the same as those of the section replaced 


677 


but, in addition, whose product of thermal conductivity and area 
is the same as that of the first section. This is 


Aiki = Asks | 


But 
Ak; Asks 
ly 
= 
or 
ks/ke = {16} 
and 


Aglsesps = 
Agle/Agls = 
Now 
a = k/pe 


as ky (+: *) (4 [18] 
as ke C3P3 As Ag l, pape 
Discussion OF METHOD FOR FINDING EQUATIONS FOR SPACING 


or LINEs 


For radial heat conduction in cylinders, the differential equation 
is 


Equation [19] may be written in the form 


+ 
| 
| 1 | | 
| | | | 
| 
| 
je 
i 
| | 
i! ts 
| | | 
Fia. 4 
= Oppo 
7 By the change of variable 


For radial heat flow in spherical shells the differential equation 


is 
ot 2 of 
+? . [22] 
By the change of variable 
1 


Equation [22] may be written 
or r‘ 
Thus it is seen that both Equations [19] and [21] may be put 
into a form similar to Equation [1]; namely 


where ® is not a constant as in Equation [1] but a function of the 


variable z. 
Now as in Equation [9] for the case of the rod, so here for the 


cylinder let 


@Ar/(Az)! = 
a 


Since for the cylinder ® = a/r? 
then 
a 
r? (Az)? 2 
or 
(Ar)? _ _ (Az)? 


2a 26 


Similarly as in the case of the rod and cylinder, for the sphere 
let 


_ (Ant (Ax)? (Az)! 


Ar 


2a 2azxr* 


since for the sphere 
® = a/rt 


For these two cases the Binder-Schmidt method simply re- 
quires that the differential equation be put in the form of Equa- 
tion [25] where ® is not necessarily a constant but may be some 
function of z, and then choose Az, the space between the parallel 
lines, in accordance with the equation 


The question naturally arises: ‘(Could not ® be other functions 
of z besides those here dealt with and also could it not be a func- 
tion also of ¢?’”’ The latter case would result in curved lines rather 
than in parallel straight lines. If ® were allowed to vary with 
other functions of z and also with t, the generality of the method 


TRANSACTIONS OF THE ASME 


would be increased to include practical problems in which the 
thermal properties themselves are affected by both the variation 
of z and of ¢, if not, indeed, of time itself. 


GENERAL METHOD OF CHOOSING SPACING OF LINES OR BARS ON 
ComputTina Device 


In the expression 


Ar = \/(26Ar) 


assume that Ar is a constant and the appropriate value of 


Aa/v/ [P(z. t)] 


can be found by 


zh dz 
V(24r) = f [29] 
re V [P(z, 
x, and x, are the corresponding z-values on two successive lines 
Az apart horizontally at a particular value of t. Now the value of 
rb dx 


ze 
one way or another, can be evaluated for particular values of ¢. 
Here z is a general value of z and a value of ¢ will be chosen which 
will give the maximum value of spacing Az. The curve of z versus 
z can be plotted for all values of z along the path of heat flow, say 
z vertically and z horizontally. Then the ordinate of this curve 
can be divided into sections of equal length 


Az = V(2Ar)........... 


The values of z corresponding to the extremities of these Az-sec- 
tions will be the points on the successive lines or bars the correct 
Az distance apart. Then the values of Az will be computed at the 
middle of the spaces so determined, but at various values of ¢, 
from the equation 


z= ... [30] 


.... [81] 


Ar = [2(z, t)Ar] 


The extremities of these spaces Az,, Axr,, Ar,, etc., would then 
be plotted at various values of t, the temperature. It is noted that 
the change in @ alters the incremental slab thickness. However, 
compensation is not made for variation in over-all thickness owing 
to a temperature-dependent conductivity. The graph or com- 
puting device might then appear as shown in Fig. 5. The shaded 


Seaces at 
Tewreratune f, 


TEMPERATURE = to 


| | | | 

| | | | 

A 


Fic. 5 Maximum Spaces 


ane A 


APRIL, 1957 


portion of the sketch, if it is to be considered as a Hackemann 
mechanism, may be thought of as movable bars or clamps: a, {, 
Y, etc., as shown. The initial temperature distribution is indi- 
cated by the initial position of the elastic string, which is repre- 
sented by the lowest inclined and broken dotted line shown in 
Fig. 5. It is held in position by all the bars a, 8, y, and so on, 
which press the elastic string down firmly on a horizontal board. 
The ends of the elastic string are always held firmly at points p and 
q. Then the alternate bars 8, 5, and ¢ are raised and the elastic 
string stretches to the position shown by the next higher inclined 
broken line (shown as a continuous line). Then the bars 8, 6, ¢ 
are lowered to press against the string and the bars y and € are 
raised and the string takes the position shown by the next higher 
inclined broken line (shown as a dotted line). The successive po- 
sitions of the string indicate successive temperature distributions 
at successive equal intervals of time. Thus 4, fi, te, etc., are the 
temperatures at section A at successive equal increments of time, 
Ar. 

All the lines and curves could, of course, have been constructed 
with a pencil on a drafting board, most of them with a straight 
edge, without the aid of an elastic string or clamping mechanism, 
but such a device, if perfected, might prove useful in expediting 
the procedure, especially when a very large number of sections 
Ar and time intervals Ar are to be employed. 

It is of interest to apply the method to Equation [1] and the 
examples given in reference (2). 

For Equation [1], ®(z, t) = a = const, and using Equations 
[30] and [31] 


rb dz — 
Va 


Ar 
A: = V/(2Ar) 


Ar = /(2aAr) 


which confirms the result, Equation [9]. 
For Equation [21] 


a a 
P(x, t) = r2 
and Equations [30] and [31] give 
re dr et — ete 
re V(a/e*) Va Va 


Ar = /(2aAr) 


which confirms the previous analysis as shown in Equation [26]. 
In order to obtain the required equal increments of r, plot e* and 
choose appropriate increments of z therefrom. 

Equation [24] may be handled in a similar manner. 

The heat-flow equation may be written in the more general 
form 


or 
Ur, t) 


a oot [32] 
By the transfermation of variable defined by 
. [33] 


Equation [32] may be changed in the following manner: 


80 
dt 
or or or Vr, t) dx \ Or or 
dx? V(r, t) 


Hence in terms of the new variable z Equation [32] takes the 
form 


ol ort 
or [W(r, t)]? da? 


This shows that the general Equation [32] may be put in the 
form of Equation [25] and may be solved by the described com- 


puting device or the corresponding graphical procedure. In this 
case 
Wr, t)dx dr 
z= f [35] 
za v za 


If @ is a constant then 


and equal increments of z correspond with equal increments of r. 


BIBLIOGRAPHY 


1 Based on a discussion with Dr. P. Hackemann in London, 
England, in 1953. 

2 “A Mechanical Apparatus for the Approximate Solution of 
One-Dimensional Heat Conduction Problems,” by P. Hackemann, 
paper presented at Symposium at the R.A.E., Farnborough, July, 
1954, Ministry of Supply, Heat Transfer Panel, England. 

“Differenzenverfahren zur Losung von Differentialgleichungen,"’ 
by Ernst Schmidt, Forschung VDI, Bd/Heft 5, September—October, 
1942, p. 177. 

4 “The Solution of Transient Heat Conduction Problems by Finite 
Differences,”’ by G. A. Hawkins and J. T. Agnew, Engineering 


Bulletin, Purdue University, Research Series No. 98. 
5 “Heat Transfer,’ by M. Jacob, John Wiley & Sons, Inc., New 
York, N. Y., vol. 1, 1949. 


Discussion 


Mert Baker.’ The useof the elastic-string analogy as the basis 
of a mechanical computing device for the analysis of one-dimen- 
sional transient heat-conduction problems is an ingenious one. 
The diversity of the Binder-Schmidt method has long been 
recognized, and when restricted by proper boundary conditions 
may be useful in solving almost any one-dimensional transient 
heat-flow problem. The disadvantage has been the labor re- 
quired to achieve the desired results by graphical construction. 
With a suitable computing device such as proposed in this paper, 
the method of solution should have far reaching significance. 

The writer has had a particular interest in developing suitable 
boundary conditions permitting application of the Binder- 
Schmidt method to evaluate the heat flow through massive build- 
ing structures which are intermittently heated and cooled. Con- 
siderable progress has been made, with the chief objection being 
the manpower required to complete the construction. This is par- 


‘ The Kentucky Research Foundation, Lexington, Ky. 


679 
Now 
or or Or W(r, t) da 
: [36] 


680 


ticularly objectionable when composite walls are considered. 
The computing device used to analyze this particular problem 
would be invaluable in determining the heating and cooling loads 
under transient conditions. 


G. M. DustnBERRE.’ The bibliography given by the authors 
might be supplemented by the following: 

“Applied Mathematics in Chemical Engineering,’ by T. K. 
Sherwood and C. E. Reed, McGraw-Hill Book Company, Inc., 
New York, N. Y., 1939. 

“Application of Schmidt’s Graphical Method to Cases Involv- 
ing Diffusivity as a Variable,’ by A. H. Brown, U. 8. Department 
of Agriculture Report ED-6-12-12, June 14, 1946, 

“Numerical Analysis of Heat Flow,’’ by G. M. Dusinberre, 
McGraw-Hill Book Company, Inc., New York, N. Y., 1949. 

“4 New Electrical Analog Method for the Solution of Transient 
Heat-Conduction Problems,’’ by G. Liebmann, Trans. ASME, 
vol. 78, 1956, pp. 655-665. 

It is of interest that Sherwood and Reed suggested that 
variable diffusivity might be handled by a simple transformation 
of co-ordinates, while Brown showed that this could not be done. 
Now the authors have solved the problem by a double set of 
transformed co-ordinates. 

But it is the fate of many problem-solving techniques to remain 
merely academic curiosities. A conspicuous example is the 
“bubble analogy” which, nevertheless, someone is always trying 
to revive as a practical method for heat-source problems. 

If the present contribution—regarded as an engineering tech- 
nique—is to escape this fate, then the authors should show the 
field in which it is useful and, within that field, the economic ad- 
vantage over competitive methods. 

Obviously the present method is limited to one-dimensional 
systems. Practically, it is limited to a small number of nodes or 
reference points. Further, in the writer’s experience, any graphi- 
cal method is intolerably messy for cyclic temperature changes. 
The problem of a variable surface coefficient occurs as often as 
not; the authors do not show how to deal with this. 

The field of application is thus considerably restricted. Even 
within this narrow field the writer feels that the method would 
hardly be competitive with Liebmann’s analog—a tool of much 
greater general utility—or with numerical methods of solution. 

If this estimate is unduly pessimistic, the authors are urged to 
refute it by examples drawn from engineering practice. 


M. J. Goauta.?’ The authors have presented an extension of 


® Professor of Mechanical Engineering, Department of Mechanical 
Engineering, Pennsylvania State University, University Park, Pa. 
Fellow ASME. 

7 Regents’ Professor of Mechanical Engineering, Georgia Institute 
of Technology, Atlanta, Ga. Mem. ASME. 


TRANSACTIONS OF THE ASME 


Hackemann’s elastic-string analog computer to the one-dimen- 
sional heat-conduction problem. There is no need for justifying 
the effort being spent in the development of such devices—they 
often serve as the vehicle by means of which solutions are arrived 
at to problems not tractable by analytic means alone. 

In this regard, two questions naturally present themselves; 
the one is related to the one-dimensional treatment and the 
second to the extension of the method discussed to two-dimen- 
sional considerations. Have the authors explored (a) the manner 
of introducing contact resistances as conditions one would be 
faced with if application of this device were made, for example, to 
laminated transformer cores, and (b) the procedures to be em- 
ployed in the use of this device if the boundary conditions are 
other than the constant temperatures used for the illustrative 
examples in the paper? Have the authors considered the feasi- 
bility of using a two-dimensional network of elastic strings in the 
extension of their analog computer to two-dimensional heat- 
conduction problems? 


Victor Pascukis.’ Starting from what is introduced as the 
“Hackemann mechanism’’ the authors present a number of in- 
teresting extensions of the Binder-Schmidt graphical method for 
solution of transient heat-conduction problems. The main com- 
plaint against this method has always been the length required to 
carry it out. If the Hackemann mechanism can shorten the time 
required in a really significant manner, the method could be 
much more useful. Did the authors actually work with the de- 
vice; and do they have any estimate regarding the time saved? 
How critical is the mechanical execution of the device? For ex- 
ample, the movable bars or clamps (Fig. 5) (a) will have a finite 
width, while they ideally should have zero width; (6) because of 
play in the joints, they may not always land on the same line, but 


only within a band. What magnitude error will be introduced by 


such unavoidable mechanical limitations? 
AutTuHors’ CLosURE 


The computing device described has not been constructed. 
However, the original Hackemann mechanical computer was 
constructed by him and proved successful in its operation. All 
operations of the envisioned computer in the paper have been 
carried out on a drafting board. For the problems considered, 
the error was never more than five per cent, when compared 
with existing analytic solutions. The problem of contact 
resistance and the two-dimensional extension of the computer 
had not been considered at the time of presentation of the paper, 
but will be given consideration. It is thought that mechanical 
devices are possible which will simulate other than constant- 
temperature boundary conditions. 


’ Technical Director, Heat & Mass Flow Analyzer Laboratory, 
Columbia University, New York, N. Y. Mem. ASME. 


e 


A Comparison of Refrigerants When Used in 
Vapor Compression Cycles Over an 


Extended Temperature Range 


An important current engineering problem involves 
providing apparatus to meet the cooling requirements of 
supersonic aircraft. Because the time is already passing 
when simple air cycles can render adequate performance 
due to their inability to extend over a sufficient tem- 
perature range, vapor compression cycles are being con- 
sidered. However, the maximum temperature range over 
which a single refrigerant can operate successfully in a 
vapor cycle is also relatively limited. Since the total re- 
quired temperature range increases rapidly with the air- 
craft flight Mach number, series combination or “stack- 
ing” of vapor cycles leading to binary and higher order 
cycles may become necessary. This paper gives methods 
for selecting the refrigerant, or series combination of re- 
frigerants, which exhibits optimum thermodynamic per- 
formance when employed in vapor cycles operating be- 
tween arbitrarily selected temperatures. 


INTRODUCTION 


HE nature of the problem of cooling supersonic aircraft is 

now becoming clear. Heat will be transferred from sources 

inside the aircraft at various prescribed temperatures to a 
sink outside the aircraft at a higher temperature. Fig. 1 presents 
the air stagnation temperature versus flight Mach number rela- 
tionship for standard NACA air. This stagnation temperature 
is the temperature of the sink to which the heat must be rejected 
from the aircraft. Even if ram air-expansion turbines, not popu- 
lar with aircraft designers because of their large size and weight, 
can be employed, the sink temperatures at Mach number greater 
than 3 are still much higher than those which the refrigeration 
engineer normally encounters. 

The heat-source temperatures depend on the function of the 
aircraft. In general, the entire aircraft will not be cooled; parts 
of it will be allowed to approach stagnation temperature. The 
temperature of three main areas of the aircraft will be reduced: 


1 Certain parts of future hydrocarbon-burning engines will re- 
quire cooling to keep metal temperatures below 2000 R. 

2 The temperature of electronic equipment of present design 
will be kept below approximately 700 R. 

3 Cabins of manned aircraft will be kept at temperatures 
around 510-530 R. 


1 Instructor, Department of Mechanical Engineering, Massa- 
chusetts Institute of Technology. Assoc. Mem. ASME. 

? Associate Professor, Department of Mechanical Engineering, 
Massachusetts Institute of Technology. Mem. ASME. 

3 Department of Mechanical Engineering, Massachusetts Institute 
of Technology. 

Contributed by the Heat Transfer Division and presented at a joint 
session with the Aviation Division at the Semi-Annual Meeting, 
Cleveland, Ohio, June 17-21, 1956, of THe American Society or 
MECHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
19,1956. Paper No. 56—SA-6. 


By J. P. BARGER,' W. M. ROHSENOW,? anv K. M. TREADWELL,* CAMBRIDGE, MASS. 


681 


je 


A 

& 

1960 9 


/ 


RAM AIR FULLY |-4 
EXPANDED. TURBINE 
EFFICIENCY = 70% 


1460 


ALA AA 


—— 


STAGNATION 


CONDITION 


TEMPERATURE R 


960 


CABIN 
460 T 


2 3 4 5 
FLIGHT MACH NUMBER-M 


STAGNATION AND Re-Expanpep Ain TEMPERATURE VERSUS 
Macu NuMBER 


1 


At any given Mach number, then, the temperature range be- 
tween source and sink can be determined by comparing one of the 
foregoing source temperatures with the stagnation temperature 
corresponding to the Mach number read from Fig. 1. 

The following sections evaluate and compare the performance 
of vapor cycles using commercially available refrigerants over 
the required temperature ranges. 


Vapor 


Cycles. The classical refrigeration cycle of Fig. 2 incorporates 
the standard condenser, constant-enthalpy expansion valve, 
evaporator, and compressor in sequence. Friction-pressure drops 
have been neglected. Heat is received in the evaporator from an 
air stream or another condensing refrigerant and is delivered from 
the condenser to another air stream, or to another evaporating 
refrigerant. 

A standard definition of cycle c.o.p. is used, which is as follows 
Q evaporator hrs — hrs 


c.0.p. = — = 
W,, compressor 


Carnot cycles with evaporator and condenser temperatures 
equal to those of the actual cycles show the thermodynamic limit 
of performance for the actual cycles. The solid process line of 


— 
| | 
| 


REFRIGERANT SATURATION 
CURVE 


TEMPERATURE R 


T, 


HEAT SINK TEMPERATURE 


WEAT SOURCE TEMPERATURE 


Te 


ENTROPY 
Fic. 2) Actuat Vapor Cycle 
A 
/ 
4 / 
« 
Ric 
hs \ / 
\ 
\ 
/ 
Tevar_|__ \ R4, 
SATURATION 
CURVE 


ENTROPY 


Fie. 3 Carnot REFRIGERATION CYCLE 


Fig. 3 represents such a Carnot cycle. For this cycle the com- 
pression process becomes reversible and adiabatic, eliminating the 
actual compressor’s irreversibility; the refrigerant expands at con- 
stant entropy rather than constant enthalpy; and the irreversi- 
bility associated with the desuperheating process is eliminated. 
The Carnot refrigeration c.o.p. reduces to 


, Freezing pt.— Critical temp., Critical press, 
Refrigerant deg F, 14.7 psia eg F psia 
3002. 51400. 
32.0 705.4 3206. 
—9.4 541. 661. 
Ethyl ether..........+. 552.1 380.8 
Trichloroethylene...... webs 520.0 728.0 
Dichloroethylene..... . 470.0 795.0 
Methylene chloride... .. — 142. 421.0 670.0 
Methyl formate........ — 147.5 418.0 607 .0 
—31. 417.4 495.0 
— 168.0 388.4 635.0 
Ethyl chloride......... 369.0 764.0 
Ethylamine........... —115. 362.0 815.0 
—221. 353.3 750.0 
Sulphur dioxide....... —98. 314.8 1141.5 
Methylamine.......... —134. 314.0 1082.0 
— 137. 294.3 474.0 
Methyl chloride. ...... —153. 289.4 969.0 
Ammonia............. —107.9 271.2 1651.0 
ee — 247. 232.7 582.1 
a — 256. 204.8 716.0 
—301. 196.5 667.2 
Nitrous oxide.......... —152. 96.5 1050.0 
Carbon dioxide........ —109. 87.8 1071.0 

— 296. .8 589.0 
reon 14 —312. —49.9 542.4 
— 297. —115.8 673.0 
— 221.0 547.0 


TABLE 1 REFRIGERANT PROPERTIES 


TRANSACTIONS OF THE ASME 


C.0.p. = 


where 7; is the temperature at which heat is received in the 
evaporator, and 7: is the temperature at which heat is rejected 
in the condenser. 

Refrigerant Limitations. The same Carnot-cycle limitation on 
the coefficient of performance between two widely separated but 
fixed temperatures applies whether one or many cycles in series 
are employed to transfer heat between the temperatures. Favor- 
ing the use of the smallest number of cycles, each with the widest 
temperature spread, is the increased mechanical complexity of 
binary and higher-order arrangements. On the other hand, the 
c.o.p. of each single cycle decreases faster than its Carnot c.o.p. as 
the temperature spread between its evaporator and condenser 
increases because the proportion of irreversible to reversible 
processes in the cycle rises, In determining the minimum possible 
number of cycles which can be used between widely separated 
temperatures, the maximum temperature spread for application 
of each refrigerant must be evaluated. 

The maximum practical temperature spread through which an 
actual refrigerant may be used is a function not only of tempera- 
ture considerations such as the freezing point but also of pressure 
and specific-volume limitations. These restrictions divide into 
two groups, those which form the upper limit of cendensing tem- 
perature and those which form the lower limit of evaporating 
temperature. 

As regards the upper limit, the critical temperature, while not 
an absolute barrier to raising the upper temperature of vapor 
compression cycles, certainly represents a practical limit. In Fig. 
4 a dashed constant-enthalpy line representing a throttling ex- 
pansion from the critical point shows how little enthalpy change 
remains to be accomplished in the evaporator. Such an expansion 
always results in a vanishingly smal! c.o.p. except for extremely 
small evaporator-condenser temperature differences. Super- 
critical condensing processes are also less efficient. Using a con- 
stant-temperature heat sink, it is possible to produce subcooled 
condenser exit states by raising the condensing pressure. How- 
ever, no advantage results from this measure because: 


‘Table 1 lists properties of commercially available refrigerants 
pertinent to the following discussion on limitations on temperature 
spread. 


———- Minimum table value-———~ — Maximum table value——— 
Temp. sat. liquid, Press. sat. liquid, Temp. sat. vapor, Press. sat. vapor. 

deg F psia deg F psia 
220.0 0.007 1390.0 1100 

32.0 0.09 5.4 3206.2 

Unstable at high temperatures 

10.0 1.38 140.0 26.79 

0.0 1.50 140.0 38.41 
—30.0 0.30 200.0 54.66 
—40.0 0.74 388.4 635.0 
—58.0 0.35 113.0 41.49 
—40.0 1.36 160.0 100.6 
—40.0 3.14 140.0 158.6 
— 58.0 1.32 113.0 98 76 
—30.0 3.4 180.0 160.0 
—80.0 0.46 140.0 84.8 
—80.0 1.95 170.0 283.9 
—20.0 7.50 180.0 210.0 
— 107.8 0.88 125.0 307.8 
—155.0 0.12 232.7 582.1 
—40.0 10. 140.0 263.5 
—155.0 0.20 160.0 448.0 
—75.0 6.37 140.0 305.0 
—50.0 16. 90.0 195. 
—127.0 14.7 96.5 1050.0 
—148.0 7.62 90.1 708 .3 
— 147.0 2.14 87.8 1071.0 
0.43 83.8 579.0 
—176.8 6.75 48.8 731.8 
— 250. 1.1 —49.9 542.4 
— 260.0 15.0 —115.8 673.0 


682 
J ri 
4 


APRIL, 1957 


THROTTLING EXPANSION 
re FROM THE CRITICAL PONT 


LI@UIO + Gaseous 
PHASE 


ABSOLUTE TEMPERATURE 


ENTROPY 


Fic.4 DiaGraM ror Pure SuBSTANCES 


(a) Evaporator heat transfer increases only minutely as a result 
of the small enthalpy decrease of the prethrottling state. 

(b) With a higher condensing pressure, a higher pressure ratio 
across the vapor compressor leads to considerably increased com- 
pressor work. 


Between identical source and identical sink temperatures, the 
combination of items (a) and (b) always results in a lower c.o.p. 
than accompanies vapor cycles where the preexpansion condition 
is on the saturated-liquid line, therefore precluding consideration 
of any condensing process ending in the subcooled liquid state, in- 
cluding supercritical condensing processes. 

In considering the effect of pressure on the maximum condens- 
ing temperature, the designer of practical systems will be forced 
to choose a pressure limit where the size and weight of the system 
necessary to contain the high pressure begins to control. The 
range of critical pressures for all refrigerants investigated extends 
from 380.8 psia for ethyl ether to 51,400 psia for mercury. 

As regards the lower limit of evaporating temperature, the only 
temperature restriction obviates operational use below the freez- 
ing point for the fluid in question. However, pressure and specific- 
volume limitations become important as evaporator temperature 
decreases. Vapor cycles have not generally been operated below 
atmospheric pressure because of the difficulty in keeping air (a 
contaminant which adversely affects cycle performance) from 
leaking into the system. Toward the lower limit of evaporator 
temperature, the specific volume of the saturated vapor increases 
rapidly with decreasing temperature. For mercury, as an ex- 
ample, the value of this property increases from 8.4 to 5210 cu ft 
per lb as the evaporating temperature decreases from 1060 to 
680 R. By necessitating enormous components, large fluid specific 
volumes possess especially little utility for aircraft refrigeration 
cycles. 

Selection of Optimum Refrigerant in a Single Vapor Cycle. A 
plot of c.o.p. versus evaporator temperature with condenser tem- 
perature as a parameter may be prepared for each refrigerant 
used in actual vapor cycles. Fig. 5 plots the c.o.p. for Freon 11 
with a compressor efficiency of 0.60.5 The c.o.p. may be read 
from the figure for any combination of evaporator and condenser 
temperatures. In comparing c.o.p. when selecting optimum re- 
frigerants for use over a particular temperature range, it is in- 
convenient first to prepare many graphs such as Fig. 5, one for 
each refrigerant, and then to read values of c.o.p. from them. 
A figure showing the relative c.o.p. for all refrigerants together 
gives a visual picture at a glance revealing which are thermody- 
namically best in each temperature range. In Fig. 6 actual c.o.p. 
for thirteen different refrigerants as well as the corresponding 
Carnot c.o.p have been plotted, each for cycles with condenser 


* 0.60 was selected as a typical value for small rotary compressors. 


540| |580] 620] 660] 700} 
| 
fi 
| 


COEFFICIENT OF PERFORMANCE — C.OP 


460 560 
EVAPORATOR TEMPERATURE - R 


660 


Fic. 5 Freon 11 Vapor Cycte PerrorMance 


INDEX 
NO REFRIGERANT 
| MERCURY 
2 WATER 
3 FREON I! 
4 METHYL CHLORIDE 
5 SULFUR DIOXIDE 
6 CARRENE 7 
7 FREON i2 
8 FREON 22 
9 ETHANE 
10 FREON 
i) ETHYLENE 
12 FREON 
13 METHANE 


COEFFICIENT OF PERFORMANCE —C.0.P 


Tcono ~ Tevap * 


|| COMPRESSOR EFFICIENCY = 0.6 


1500 


EVAPORATOR TEMPERATURE -R 


Fic. 6 C.O.P. ror Various RerricerRaANts Versus EvaPoRATOR 
TEMPERATURE 


temperatures 100 F higher than evaporator temperatures, and 
compresser efficiencies of 0.60. Inspection of Fig. 6 reveals some 
interesting effects: 


1 Refrigerants employed in the same temperature range vary 
in performance for equal component efficiencies, allowing selection 


683 
| 
A 
TA IAL UAL A 
prof 
IAA | 
40.0 
4 
Se 
im YT TT 
| | | | Shae 
000 


684 


of a thermodynamically optimum refrigerant for that range. 

2 The highest values of the actual c.o.p. are consistently 
slightly greater than half of the values of Carnot c.o.p. throughout 
the entire temperature range considered. This would be ex- 
pected with a compressor efficiency of 0.60 if the irreversibility of 
the expansion process causes little loss. 

3 The working temperature range of each refrigerant appears 
to decrease with decreasing absolute temperature. 

4 Thec.o.p. of a 100 F cycle (of all cycles, in fact, independent 
of their evaporator-condenser temperature difference) exhibits a 
definite optimum point, falling off rapidly as the condenser tem- 
perature increases toward the refrigerant critical temperature, 
more gradually as the evaporator temperature decreases below 
the optimum point. 

5 All c.o.p. tend to blend together on the same line as the dif- 
ference between the refrigerant critical temperature and the cycle 
evaporator temperature increases for each refrigerant. However, 
in every case, this region for each refrigerant is accompanied by 
extremely small pressures and large vapor specific volumes, pre- 
cluding use in this range. 


The selection of a thermodynamically optimum refrigerant in a 
particular temperature range can be accomplished as follows: 


(a) The range where refrigerant absolute pressure exceeds at- 
mospheric pressure and refrigerant specific volume is lowest 
corresponds to the “hook’’ or the maximum c.o.p. part of each 
curve shown in Fig. 6. 

(b) Selecting the appropriate temperature range on Fig. 6, ob- 
serve which refrigerants exhibit the highest c.o.p. in this range. 
Select the refrigerant which exhibits the generally highest curve. 

(c) Where two or more refrigerants appear extremely close in 
performance, calculations for each refrigerant in the particular 
cycle desired will reveal which exhibits better performance. 


Additional Factors in Selecting Refrigerants. So far, only ther- 
modynamic performance has been considered in selecting optimum 


TABLE 2 CHECK LIST OF ADDITIONAL FACTORS FOR SELEC- 
TION OF REFRIGERANTS 


Density of liquid and vapor 

Viscosity of liquid and vapor 

conductivity 

Oil solubility 

Chemical stability 

Toxicity and odor 

Reaction with water and oil 

Reaction with materials used in cycle components and in the structure of 
the aircraft 

Leak detection 

Methods of handling 

Other safety considerations 

Cost 


refrigerants. However, the designer of practical systems will 
want to consider many other factors such as those listed in Table 


Serres CoMBINATION OF VApoR CYCLES 


Selection of Optimum Combinations of Refrigerants. As pre- 
viously mentioned, the least mechanical complexity accompanies 
the least number of cycles used in series between two widely sepa- 
rated temperatures. Fig. 6 shows that a refrigerant selected with 
its optimum c.o.p. point near the desired condensing temperature 
will, in general, continue to give thermodynamic performance 
equivalent to other refrigerants at lower temperatures as its 
evaporator temperature is lowered indefinitely. In other words, 
if the critical temperature of a refrigerant exceeds the condensing 
temperature of the desired cycle, it will by itself give optimum 
thermodynamic performance, at the same time yielding least 
mechanical complexity. Unfortunately, however, decreasing 
saturation pressure and increasing vapor specific volume simul- 
taneously dictate a practical lower limit of application for any re- 


TRANSACTIONS OF THE ASME 


frigerant. In order to prevent operation at pressures less than 
atmospheric and at enormous values of specific volume requiring 
large apparatus volume and weight, it becomes necessary to con- 
struct multiple cycles, stacking them temperaturewise on top of 
each other as shown in Fig. 7. 


SATURATION CURVE Rig 
REFRIGERANT A 


_ SATURATION CURVE 
REFRIGERANT 


s 


Fic. 7 Binary Vapor Cycie 


A series of refrigerants, each of optimum performance in its 
own temperature range, is selected to comprise a multiple cycle, 
remembering that in Fig. 6 the condenser temperature for each 
refrigerant is 100 deg F higher than the evaporator temperature 
at which the c.o.p. is plotted. As an example, in constructing an 
optimum multiple cycle between temperatures of 1000 and 200 R, 
one might choose mercury, Freon 11, Freon 13, and Freon 14 as an 
optimum combination. In general, use of these same refrigerants 
in widely spaced cycles requiring but two or three of them in 
series also would be justified, but the exact selection definitely 
depends on the end temperatures required. In single cycles, for 
instance, there are other refrigerants besides the four mentioned 
which work more successfully in the range 400-550 R with small 
evaporator-condenser temperature spreads because their par- 
ticular optimum performance corresponds exactly to the tem- 
perature range considered. 

Calculation of C.0.P. for Cycles in Series. The Carnot c.o.p. 
for a multiple cycle can be calculated by the same expression as 
given by Equation [2] since it is a function only of the end tem- 
peratures of the cycle, i.e., the highest condensing temperature 
and the lowest evaporating temperature. Besides the irreversibili- 
ties already present in the individual cycles, an additional irre- 
versibility associated with the transfer of heat across the finite 
temperature difference between cycles appears. Because of the 
boiling and condensing processes taking place in these intermediate 
condenser-evaporators, the temperature difference will be rela- 
tively small, in the vicinity of 10-20 F. Using a temperature 
overlap of this order of magnitude, the multiple cycle c.o.p. can 
be calculated. The definition of c.o.p. for multiple vapor cycles 
becomes 


multiple Qevaporator (lowest single cycle) [3] 
= 
po Total compressor work (all cycles in series) 


For the binary cycle of Fig. 7 this expression becomes 


C.0.P.binary 
cycle 


(4) 
compressor work (a +b) 


_ _€.0.P.g X €.0.p.5 
C.0.P.g + C.0.P.5 


APRIL, 1957 


For a tertiary cycle, the expression becomes 


C.0.D-tertiary Qiowest evap. temp. )(c) 


. [6] 


Total comp. work (a +b +c) 


X €.0.p.4 X €.0.p.- 


Since the complexity of the multiple cycle c.o.p. equation ob- 
viously increases rapidly with the number of single cycles in com- 
bination, a more desirable procedure for obtaining numerical 
values of higher-order multiple-vapor cycle c.o.p. follows: 


1 Determine the c.o.p. for each single cycle. 

2 Start from one end of the series; combine the first two 
cycles, evaluating the c.o.p. of these two cycles with the binary- 
cycle equation. 

3 Treating all previously considered cycles as a single cycle, 
continue to add single cycles using the binary-cycle equation 
repetitively. 


The entire process, carried out for n-cycles in series, involves n 
calculations of single c.o.p. and (n — 1) calculations of binary 
€.0.p. 

Optimum Intermediate Evaporator-Condenser Temperatures. 
Fig. 6 demonstrates that the c.o.p. of virtually all refrigerants 
used in 100 F vapor cycles with constant enthalpy expansion, no 
line losses, and 60 per cent reversible adiabatic compression, 
tends to follow the straight line 


Tevap R 


c.O.p. = 5.15 X 10-3 T evap 194 


. [8] 


or equivalently 


c.o.p. = 0.515 (¢.0.p ) Carnot 


in a region distant from the critical point. Some refrigerants, 
however, appear to drop off in this range, so that their c.o.p. be- 
comes less than that of the refrigerants in the next lower tempera- 
ture range. The question then becomes: Can both of these 
representations be correct? No definite conclusion can be reached 
regarding this question, but since mercury is the chief offender in 
this respect, it is suspected that all refrigerants do tend toward 


C.0.P.4 X ¢.0.p4 + X + X €.0.p.¢ + + ¢.0.py +¢.0.p.. +1 


the straight-line c.o.p. versus evaporator-temperature relation- 
ship in the region far from the critical point. Two other re- 
frigerants were found whose data showed extremely erratic per- 
formance in Fig. 6; both nitrous oxide and butane were omitted 
for this reason.’ Where the data are suspected of inaccuracy, the 
lines have been shown dashed instead of solid in Fig. 6. 

With the conclusion reached in the foregoing, there can be no 
optimum evaporator-condenser temperature between successive 
refrigerants in multiple cycles which results from considerations 
of thermodynamic performance alone. Instead, practical re- 
quirements limiting equipment size and pressure cause the de- 
signer to add another refrigerant to the series at or near the point 
where the saturation pressure of any refrigerant goes below at- 
mospheric. 


ACKNOWLEDGMENT 


The authors gratefully acknowledge the assistance of the 
General Electric Company in supporting the program from which 
this paper is drawn. 


REFERENCES 


1 “Refrigeration Fundamentals,"”’ ASRE Data Book, seventh 
edition, 1951. 

2 ‘Properties of Commonly-Used Refrigerants,’’ ACRMA, 1948. 

3 “Thermodynamic Properties of Steam,’’ by J. H. Keenan and 
F. G. Keyes, John Wiley & Sons, Inc., New York, N. Y., 1936. 

4 “Thermodynamic Properties of Mercury Vapor,"’ by L. A. 
Sheldon, General Electric Technical Information Series Report DF 
76054. 

5 ‘Total Heat-Entropy Diagram for Mercury,’’ by L. A. Sheldon, 
General Electric Company, April 15, 1948. 


* The data for mercury are known not to be highly accurate, but 
probably the greater reason for the obvious discrepancy is the in- 
accurate large extrapolation made of the data. 

7 New revisions of data for some of the substances can be expected 
to improve the appearance of Fig. 6 in that a closer conformance to 
the straight line will probably be shown. 


a 

. 


This paper describes an analytical method for determin- 
ing velocities and accelerations in mechanisms by means 
of geometry of the complex plane. In addition to analysis 
of the four-bar linkage, a special solution is worked out for 
direct-contact mechanisms. The latter is included be- 
cause, though accelerations in direct-contact mecha- 
nisms can be analyzed in terms of their equivalent four-bar 
linkages, there are certain cases where it is explained that 
the four-bar-linkage analysis breaks down. Further, it is 
shown that the method is general and is applicable to 
any mechanism having plane motion including complex 
mechanisms which cannot be treated as a series of four- 
bar linkages. 


INTRODUCTION 


[ SE of the complex plane for velocity and acceleration 
analysis in linkwork is not new. Beyer (1)* discusses an 
application of complex functions made by Bloch (2). 

Bloch developed a procedure for determining the relative lengths 

of the links for the four-bar linkage so that certain prescribed 

angular velocity and acceleration ratios between driving and 

driven crank would be satisfied. In a later paper, Rosenauer (3) 

has shown how the relative velocity and acceleration components 

occurring in various common linkages can be expressed in complex 
form, 

The main purpose of this paper is to present a technique for 
handling in complex form the relative velocities and accelerations 
so that ultimately the velocities and accelerations of each of the 
links in a mechanism can be related to those of the driving mem- 
ber. The technique is not limited to the four-bar linkage, but ap- 
plies also to complex linkages which cannot be analyzed as a series 
of four-bar linkages. 

Though graphical solutions by the relative velocity and ac- 
celeration method are quicker than obtaining numerical results by 
the equations which are developed here, mathematical analysis 
nevertheless is advantageous under certain circumstances as, for 
example, when greater accuracy is desired. This would be par- 
ticularly true for mechanisms in which the relative size of the 
links or instantaneous positions of the links is such that graphical 


1 From a dissertation submitted by the senior author in partial ful- 
fillment of the requirements for the degree of Doctor of Philosophy, 
Northwestern University. 

2 Associate Professor of Mechanical Engineering, Michigan State 
University, East Lansing, Mich. Mem. ASME. 

3 Professor of Mechanical Engineering, Technological Institute, 
Northwestern University, Evanston, Ill. Mem. ASME. 

4 Numbers in parentheses refer to the Bibliography at the end of 
the paper. 

Contributed by the Machine Design Division and presented at the 
Semi-Annual Meeting, Cleveland, Ohio, June 17-21, 1956, of Tue 
AMERICAN Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, March 
1, 1956. Paper No. 56—SA-32. 


An Application of Complex Geometry to 
Relative Velocities and Accelerations 
in Mechanisms 


By G. H. MARTIN? ano M. F. SPOTTS: 


687 


results would be subject to large error because of the difficulty in 
accurately determining points of intersection on a drawing; for 
example, the point of intersection of nearly parallel lines. Further, 
mathematical analysis can be of value from the standpoint of 
systematic design. 

The computation of the angular velocities and accelerations as 
well as the linear velocity and acceleration of any point on the 
linkage can be systematized and the substitution of numerical 
data into the equations developed herein can be carried out in 
tabular form. This would speed up the work if results were de- 
sired for a number of positions of the driving member. Solutions 
by tabulation can be carried out with the aid of a desk calculat- 
ing machine by a person unskilled in kinematics once the form of 
the table is set up. 


R 
x 
(4) 
Fie. 1 


In Fig. 1, points A and B are moving in the zy-plane relative to 
the fixed origin O. The position of A relative to O is specified 
by modulus r; and argument 6,; the position of B relative to A is 
specified by means of modulus r; and argument 43. Vectors OA 
and AB are considered positive when directed as shown; the 
counterclockwise direction is positive for angles 6, and @;. In 
general ro, r3, 4, and 6; are all functions of time. 

Resultant position vector R can be expressed as 


R = OA +—~ AB 


= + 
Should a third point C be present in Fig. 1, Equation [1] would 
be written with an additional term r,e" to express the vector BC. 
The exponentials in Equation [1] can be expanded and the real 
parts collected together and the imaginary parts collected to- 


gether. Modulus and argument for & are then 
mod R = + {2] 


Velocity vector R for point B is obtained by differentiation of 
Equation [1] with respect to time 


R = + + + [4] 


688 


This equation also can be written in the following manner 


where 


absolute velocity of A in radial direction 

absolute velocity of A in transverse direction 
velocity of B relative to A in radial direction 
velocity of B relative to A in transverse direction 


These components, properly directed for positive quantities, 
are shown in Fig. 2. It should be noted in Equation [4] that a 
vector lies along the corresponding r except when the term is pre- 
ceded by an 7 in which case the vector is rotated 90 deg counter- 
clockwise from r. 


Ag, Coriolis 


Aq, Coriolis 


Fie. 3 


We can expand Equation [4] in terms of its real and imaginary 
parts. The modulus and argument of R can then be expressed 
similar to Equations [2] and [3] except that 2 must be replaced by 
R. 

Acceleration vector 2 for point B is found by differentiation of 
Equation [4] with respect to time 

= A, +— A, +— A; +— A, (Coriolis) 
+—> As +— Ag +— A; +— As (Coriolis)... . [5a] 


where 


A, +-—> A; = A,” = absolute acceleration of A in radial direc- 
tion 


TRANSACTIONS OF THE ASME 


absolute acceleration of A in transverse 
direction 

acceleration of B relative to A in radial 
direction 

acceleration of B relative to A in trans- 
verse direction 


As; Ag = 


A; +— As = Apy,' 


These eight components are represented in Fig. 3. It should be 
noted, because of the minus signs, that vectors Az and Ag are 
directed in the opposite sense to vectors OA and AB. 

From the real and imaginary parts of Equations [5] we can ob- 
tain expressions for the modulus and argument of R. They are 
similar to Equations [2] and [3] except that R is replaced by R. 

Since many mechanisms can be reduced to an equivalent four- 
bar linkage or a combination of such linkages, the preceding equa- 
tions are first applied to this basic type. 


Four-Bar LINKAGE 


In Fig. 4 lengths ro, rs, and ry as well as the angles 62, 03, and 
6, are assumed to be known. The angular velocity 6, and angular 


4 


Fia. 4 


acceleration 6, are also known. Since re, rs, and ry are constants, 
iy = 7; = % = 0. Also, since the velocity of C relative to O is 


zero 
= /o +— Visa +— = 0 
This may be written in the form of Equation [4] and can be ex- 


panded in terms of the real and imaginary parts as follows 


Ri V. sin 6, sin 6; sin 6, = 0 
. [6] 


= cos 0 + 136; cos 0; + cos = 0 
Equations [6] can be solved for the unknowns 6, and 6 
r; sin B 


ro sin Y 


sin B 


where = 6, B 6; — and Y= 6. — 6s. 
The velocity for any point P on link 3 may be written in the 
form of Equation [4] 


= + 


- 
be 
i t 
Neva 
r 
t B 
4 
Fic. 2 2 
6 
As 
B J 
As 
6, = Az 
—T A 
fe x 
0 | 
im 


APRIL, 1957 
AP and 6; 
G(V,) = sin pb; sin 


I( = robe cos 6, + pbs cos y } 


The magnitude and direction of V, are then found from Equa- 
tions [2] and [3] by substituting V, for R. 

The sum of the relative accelerations from O to C must equal 
zero. Thus 


+-> A.p! = 0 


This can be written in the form of Equation [5]. The terms con- 
taining * and r quantities are equal to zero and the real and 
imaginary parts of the resulting equation become 


cos 6; 


rida sin 6, = 0 


= —126.2 cos 02 — sin 


— r30; sin 0; — cos 0, 


HAejo) = sin + cos 0. — sin 


4+ cos 6; “ sin 6, + ribs cos 6, = 0 


Solving Equations [10] for 6; and 6,, and substituting Equations 
[7] and [8], we have 


6, = 6; COs + rb? cos 8 + 
6: r; sin B 

6, 6, + cos + ribs? + r 6. cos B 
6; sin B 


The linear acceleration of any point on the linkage may be found 
from a summation of the relative linear accelerations. For ex- 
ample, the acceleration of point P in Fig. 4 may be written in the 
form of Equation [5] 


A, = +— Aa/ot Ap/a” +— Ap/e' ) 

= +. — + ipBse™ | 

G(A,) = cos 62 + 6, sin 0.) — cos 
+ 6, sin py) | [13] 

9(A,) = 6.2 sin 6. — 6, cos 62) — sin | 

~ 6; cos y) } 


The magnitude and direction of A, are then found from Equations 
(2] and [3] by substituting A, for R. 

The slider-crank mechanism in Fig. 5 is a special case of the 
four-bar linkage; i.e. 


3 
6, = const = - rm, = ©, and thus 6, = 6, = 0 


689 


Equations [7] and [11] give the angular velocity and acceleration 
respectively of the connecting rod. 

There are certain direct-contact mechanisms in which one or 
two of the links in the equivalent four-bar linkage become infinite 
in length, and then the angular accelerations become indeter- 
minate by Equations [11] and [12]. Examples are a disk cam with 
a flat-faced oscillating follower, and a disk cam having a re- 
ciprocating follower whose face is either curved or flat. Graphical 
analysis using the method of relative accelerations likewise is 
known to be inapplicable to the equivalent four-bar linkage in 
such cases. Graphical analysis is accomplished, however, by a 
consideration of the relative accelerations between the points in 
contact on cam and follower. The mathematical analysis pre- 
sented in the following section likewise involves the relative ac- 
celerations for the contacting points, and the equations de- 
veloped are applicable to any direct-contact mechanism. 


Drrect-Contact MECHANISMS 


Bodies 2 and 4 in Fig. 6 have arbitrary curves, and it is as- 
sumed that at all points their radii of curvature are known. A 
and B are points on 2 and 4, respectively, which are coincident at 
the instant considered. The lengths OA and BD together with 
62, 0;, and 67 are assumed to be known. Either 6, and 6, are known 
and 6, and 6, are to be determined, or 6, and 6, are known and 6, 
and 6, are to be found. 


r 


Solution to the problem can be effected by considering point A 
as being temporarily fixed on link 2, with position vector AB of 
length r; = 0 lying along the tangent. Angle 8; is measured from 
the z-axis and terminates with the position vector AB. The gen- 
eral layout is similar to that of Fig. 1. With these assumptions 
#) = ry = 0, and Equation [4] as applied to Fig. 6 becomes 


= 4 ... [14] 


Expanding Equation [14] in terms of its real and imaginary parts 


—r,6, sin 0, + rob» sin 0. — 7; cos 07 = 0 ) 


[16] 
cos 6, rob» cos 6, sin Or = 0 


Elimination of 7; from Equations [16] gives 


rz cos (0, — Or) 
2 


r, cos (0, — 


Next, elimination of 6, from Equations [16] gives 


= 
.. [10] 
N 
B A x 
A 
E 
! 
6 
| 2 G 
Ve 
N 
Fie. 6 
8, or 
AS 
od 
; 
Fic. 5 


sin (0, 6,) 
cos (07 — 4) 


= 


Although the modulus of position vector AB in Fig. 6 is zero, 
the angular velocity of this vector relative to the tangent will in 
general not equal zero. Consider Fig. 7; 72 is the radius of curva- 
ture at point A of the path which point B descvibes on body 2. 
Since A and B are coincident, angle 6; is equal to 07. However, B 
moves to B’ in time interval dt with the chord length BB’ equal to 
dr;. The corresponding change in 4; is d@;. Hence in Fig. 7 

dr; 


Tee 


This equation can be divided by dt and rearranged as 


In Fig. 7, dr; = BB’ is positive in sign and thus 7; = (dr;)/(dt) 
is positive. dis positive because it is measured counterclockwise 
from the tangent and thus 6; = (d@;)/(dt) is positive. Then for 
agreement in signs in Equation [19] 72. must be considered posi- 
tive when the center of curvature lies to the left of point B as 
shown. 


Fig. 7 


The angular velocity of r; is then equal to 6, plus 6; with the 
latter determined by Equation [19]. Our general equation for 
acceleration, Equation [5], then becomes 


+r? + 2, (4 + =) ie“? 
Ag/a’ +— = Az +—> As +—> As +> As 


Expanding Equation [20] in terms of its real and imaginary parts, 
we obtain a pair of equations similar to Equations [16]. Solving 
them for 6, and substituting the relationship expressed by Equa- 
tion [17] we find 


TRANSACTIONS OF THE ASME 


COS (0, 67) 


maces 12622 sin (0, Or) + (6 + [21] 


The Euler-Savary Equation may be used for determining ry. 
de Jonge (4) shows how this equation is easily derived using Hart- 
mann’s (5) construction for the center of curvature of the path of 
any point of a moving system. For Fig. 6, this equation becomes 


(22) 


where 


BP and P is the instant center for bodies 2 and 4. P lies 
at the intersection of the normal with line DO 
= PG and G is the center of curvature of the path which 
point B of body 4 describes on body 2 
' = FP where F is the center of curvature of body 4 at point 
B 
PE where E is the center of curvature of body 2 at point 
A 
Then 


me=r+r= 


Each of the quantities in Equations [22] and [23] is positive if it 
extends from 2 toward 4 along the normal. 


CompLex MECHANISMS 


The linkage in Fig. 8 is an example of a complex mechanism. 
The determination of the velocities or accelerations in a complex 
linkage consists of starting at some point of known velocity or 
acceleration, usually a point where these quantities are zero, and 
then traversing successive links summing up the relative velocity 
or acceleration components along the way in exponential form 
until another point of known velocity or acceleration is reached. 
The process is repeated until all the links in the mechanism have 
been traversed. As was done earlier, each vector equation is 
written in terms of its real and imaginary parts and thus pro- 
vides two independent algebraic equations containing various 
unknown quantities. The foregoing procedure, in general, will 
provide a sufficient number of equations to permit a solution for 
the unknowns. 

As an illustration of the procedure consider Fig. 8. It is as- 


690 
r r ro ro 
dé; 1 dr; 
or 
: 
— 
“Sa 
Th2 } 
—A 
x 
6. 
Z T 
9s 
c 
.. [20] 6 
9¢ 
Fie. 8 


APRIL, 1957 


sumed that 2, 43, 9, 9s, 9, 62, and 6 are known. The angular 
velocities and accelerations of all the links are to be found. Let 
= BF, rs = BD, = DE, rs = EG, = HC, andr; = CE. 
Velocities. Since points F, G, and H are points of zero velocity 
Vijn +—> +> Ver = 0 


+ + ir;b,e°% = 0... .. [24] 


+-> Var Vesa V ole = 0 


[25] 


The sums of the real and imaginary parts of Equations [24] and 
[25] then give 


sin 0, — sin 0, — sin 0, = 0 
Os cos 6. + 0, cos 6, + rss cos 6, = 0 
robe sin 


rob. cos 0. + 730; cos 0, + Oy cos 6, + rss cos 6, = 0 


sin 0; — sin 0, — sin 0, = O 


After the numerical values of the known quantities are sub- 
stituted into this set of equations, the latter can be solved for the 
values of 6,, and 6.. 

Accelerations. Points F, G, and H have zero acceleration; thus 


A,’ = 0 
or 
% 


% + rsOsie'™ = 0. [26] 


Also 
Ay? +— Aaj’ Aap! 
+-- A./a' +-— +-> = 0 


or 


+ = 0. . [27] 


+ 
Next, the sums of real and imaginary parts of Equations [26] 


and [27] are 


cos 6, — sin 6, cos 6, 
sin 6, 13052 cos 6; amen sin 6; =0 


sin 0; + cos 0, — 776.2 sin 
cos 6, sin 6; + cos 6; = 0 


cos — robe sin 0. — cos 0; — 736; sin 0; 
— cos 0, — sin 0, — cos 0, — ri; sin = 


sin + cos 02 — 1365? sin 0; + 136; cos 0; 
— 62 sin 0, + 8, cos 0, — rb,? sin 0; + 78, cos 0, = 0 
The numerical values of all known quantities are then sub- 


stituted into this set of equations. The latter can then be solved 
for the numerical values of 63, 6,, 6s, and 6s. 


7 


Fie. 9 


Example. For the cam in Fig. 9, R; = 3'/2 in., Re = 97/s in., 
R; = 1/2 in., 6. = 3 rad/sec, and 6; = —1.5 rad/sec?. The angu- 
lar velocity and acceleration of link 4 are to be found for y = 35 
deg. 

Solution. From the trigonometry of the figure, re = 4.20 in. = 
0.35 ft; r’ = BP = 440 in. = 0.367 ft; rm. = 17.0 in. = 1.42 ft, 
6, = 19 deg = 0.332 rad, 6. = 83.2 deg = 1.45 rad, 6, = 14 deg 
= 0.244 rad. From Equation [17] 

0.35 cos (1.45 — 0.332)3 


* = 142 cos (0.244 — 0.332) eae 


and from Equation [18] 


sin (1.45 — 0.244) 


13) (0.332 — 0.244 


= 0.983 fps 


Substituting into Equation [22] we have 


1 1 1 1 


045 


or 
r = 1.88 ft 
Then from Equation [23] 


—0.367 + 1.88 = 1.53 ft 


re = 


Equation [21] gives 


0.326 1 
3 15) + cos (0.244 — 0.332) 
sin (0.244 — 0.332) — 0.35(9) sin (1.45 — 0.332) 


0.983 
+ 2(0.983) (3 + a 


= 2.46 rad/sec? 


BIBLIOGRAPHY 


1 ‘“Kinematische Getriebesynthese,"’ by R. Beyer, Springer-Ver- 
lag, Berlin, Germany, 1953, pp. 189-191. 

2 “Syntheses of Four-Link Mechanisms,” by Z. 8. Bloch, Izves- 
tiya Academiyi Nauk USSR Otdeleniye Technicheskikh Nauk 1940, 
No. 1, pp. 47-54. 

3 “Gelenkvierecke mit vorgeshriebenen Groesst- und Kleinst- 
werten der Abtriebswinkelgeschwindigkeit,’”’ by N. Rosenauer, Werk- 
stattstechnik, vol. 38, 1944, pp. 25-27. 

4 “A Brief Account of Modern Kinematics,”’ by A. E. R. de 
Jonge, Trans. ASME, vol. 65, 1943, pp. 663-683. 

5 “Ein Neues Verfahren zur Aufsuchung des Krummung- 
kreises,”” by W. Hartmann, Zeitschrift des Vereines deutscher In- 
genieure, vol. 37, 1893, pp. 95-101. 


691 
; 
Also 
or 


Discussion 


Wa. J. Carrer.’ Velocity and acceleration analysis of ordinary 
four-bar mechanisms by the complex number vector method is 
not new, as the authors state. The writer has used this method 
for velocity and acceleration analysis and also for the determina- 
tion of higher time derivatives of motion. The writer shares the 
author’s opinions about the advantages and disadvantages of this 
method. The wider usage of digital computing equipment 
should make this method more attractive. 

The authors are to be commended for the extension of the com- 
plex number vector method to the analysis of direct contact 
mechanisms. This feature is new so far as the writer is aware. 

It is of interest to note that the author’s Equation [17] may be 
established from the known properties of the pitch point P. 
Let DK and OH be the lengths of the perpendiculars from D and 
O respectively to the line N-N. It may be seen that 

OH _ 700s (6:—67) _ OP _ & 
r,cos (Q,— O67) DP 6 


DK 


OP cos (02 — Or) 
- 
DP cos (0, — 67) 
which is the,same as the author’s Equation [17]. One might now 
obtain an expression for 6, by differentiation of Equation [17] 
with respect to time. This would, of course, lead to the same re- 
sult as that given by the authors, although the physical interpreta- 
tion would now involve the time derivatives of DP and OP. 
Koenig® has used this method for the analysis of the ordinary four- 
bar linkage. 

Allen H. Candee, in his discussion of a recent paper by Freuden- 
stein’ has analyzed the direct contact problem by a method 
which is slightly different from that of the authors or the method of 
Koenig. 

It should be noted that the vector, r; = 0 in Fig. 6, may not be 
considered as forming an equivalent four-bar linkage with re and 
r,in the normal sense. If such were the case the crossing of 7-7 
and DO would be the pitch point which is obviously incorrect. 


FERDINAND FREUDENSTEIN. The introduction of complex 
variable techniques in kinematics in 1940 by S. Sh. Blokh came 
about as a result of investigations in kinematic synthesis, which 
field constitutes the major portion of modern kinematics. Up to 
the present there has been no major effort involving complex 
variables in mechanisms synthesis work in this country, although 
the potentialities of the method are great. To the extent that the 
authors, while concerned with kinematic analysis, focus attention 
on the use of complex variables, they have performed a service to 
the profession. . 

The basic development (i.e., the Introduction) represents no 
new ideas or principles as is evident from a perusal of pp. 189-195 
of R. Beyer’s ““Kinematische Getriebesynthese’’ (authors’ ref. 1)— 
as the authors well know—but the derivations using complex 
variables, which follow, have not previously appeared in Western 
European or American technical literature as far as is known to 


§ Associate Professor, Department of Mechanical Engineering, The 
University of Texas, Austin, Texas. Mem. ASME. 

***A Uniform Method for Determining Angular Accelerations,”’ by 
L. R. Koenig, Journal of Applied Mechanics, Trans. ASME, vol. 68, 
1946, pp. A-41-44. 

7“On the Maximum and Minimum Velocities and the Accelerations 
in Four-Link Mechanisms,’’ by F. Freudenstein, Trans. ASME, vol. 
78, 1956, pp. 779-787. 

8 Assistant Professor, Department of Mechanical Engineering, 
Columbia University, New York, N. Y. Assoc. Mem. ASME. 


TRANSACTIONS OF THE ASME 


this discusser. Some of the velocity equations can be read off at 
sight by considering the equivalent four-bar linkage and using the 
well-known theorem relating the angular velocity ratio to the ratio 
of the perpendiculars drawn from the cranks to intersect the con- 
necting link. Considerable relevant work along the same lines 
can be found in the writings of S. Sh. Blokh and these have been 
reviewed in the Applied Mechanics Reviews. Some specific com- 
ments follow: 

(a) The description of references (1) and (3) leaves room for 
improvement. Beyer (1) describes an application of complex 
variables to the synthesis of four-bar linkages having prescribed 
values of the angular velocities and angular accelerations of the 
links. Rosenauer (3) describes an extension of this technique to 
the synthesis of linkages having prescribed extreme values of the 
angular velocity ratio of driving and driven links. 

(b) In using the equations developed by the authors, it may be 
found desirable to summarize the equations in easy-to-use tabular 
form. This would permit the use of modern computational facili- 
ties. 

(c) A method used by 8. Sh. Blokh and more recently also by 
K. H. Sieker utilizes complex conjugates to eliminate unwanted 
unknowns. In a four-bar linkage, for instance, usually only the 
displacement, 62, of the driving crank and the lengths of the links 
are known; in deriving the angular velocity and acceleration of 
the driven link it would be convenient, therefore, if no quantities 
other than these were to enter the equations. This can be done 
as follows: 

Let r, = OC, Fig. 4. 
the equation 


Chain closure is expressed, therefore, by 


ree? + rye 4+ = 


We can write in addition the complex conjugate form of this equa- 
tion which applies to the mechanism reflected about the fixed 
link, OC 


ree + + = 7, 


We can eliminate “unwanted’’ 6, from these equations, obtainin 
1 


rls 


re 


Upon differentiation of this equation, it can be shown that 
To rs sin (0. 
“14 re sin (0. — 0) + sin 0, 
6.6, 
6. 
717202? cos + cos 0, — ~ 6,)? cos (82 0) 


sin 6, 


6, = 


sin (0. 


of course these equations still include 4,. 

These equations should be compared with Equations [8] and 
[12] of the authors. This discusser respectfully suggests that the 
authors consider the possibilities ef this technique in their future 
work, especially in the field of kinematic synthesis. 


A. 8. Haut, Jr.2 The increasing availability of digital com- 
puting equipment makes it profitable to re-examine our ways of 
doing things in many fields of engineering. We have barely 
touched upon the possibilities for using such equipment in 
kinematic analysis and synthesis. The authors have performed 


® Professor, School of Mechanical Engineering, Purdue University, 
Lafayette, Ind. Mem. ASME. 


APRIL, 1957 


a service in spelling out an approach to writing velocity and ac- 
celeration equations which can easily be programmed for auto- 
matic computation. If we are to go in for automatic computa- 
tion of velocities and accelerations, then we may as well include 
position. The authors have assumed position data known in all 
their example problems, perhaps feeling this problem to be trivial. 

In connection with the example on direct contact mechanisms 
perhaps a word might be added on the subject of “equivalent 
linkages.’ It is implied in the paper (as well as in most text- 
books) that there is one equivalent linkage. Actually there exists 
an infinite number of four-bar linkages equivalent, insofar as in- 
stantaneous velocities and accelerations are concerned, to a given 
direct contact mechanism. To form such an equivalent four-bar 
we could choose, quite arbitrarily, a point M on body 2 (see Fig. 6 
of the paper) to be the location of the pin joint between 2 and the 
connecting-rod of the equivalent four-bar. Then, using the 
Euler-Savary equation, we could solve for the location N of the 
center of curvature of the path which M; traces in the motion of 2 
relative to 4. If we then let N be the location of the pin joint be- 
tween 4 and the connecting-rod we shall have our equivalent 
mechanism, 

The authors have actually found one such equivalent mecha- 
nism in their example problem, Fig. 6. It is formed by pinning a 
connecting-rod between B on body 4 and G on body 2. 

This somewhat larger view of the equivalent linkage does make 
it possible to reduce the direct contact problem to the four-bar 
problem, even for the case in which one of the direct contact 
bodies has infinite radius of curvature. 


R. T. Hinxie.” The authors state that the theory presented 
in this paper for the determination of velocities and accelerations 
However, they have generalized it by extending it to 
include direct contact and complex mechanisms. Since other 
analytical methods for determining acceleration become so 
cumbersome when applied to complex mechanisms as to be almost 
useless, this paper is a major contribution. 

The graphical method will probably remain in wide use for 
velocity and acceleration analysis, but when synthesis is con- 
sidered, some form of analytical approach is needed. It may be 
that this paper will have its greatest value in the development of 
this field. At the present time, one investigator is applying the 
theory presented here for the determination of inertia stresses in 
mechanisms, 


is not new. 


AvuTuors’ CLOSURE 


Professor Carter points out that Equation [17] which applies 
to direct contact mechanisms may also be established from the 
known properties of the pitch point P in Fig. 6. Further, he 
states that this equation may be differentiated to obtain an ex- 
pression for 6,, a method which Koenig has used in an analysis 


of the ordinary four-bar linkage. This is true. However, if 
we were to follow this suggestion we would find that the resulting 
expression obtained for 6, would contain the quantities (6. — 6,) 
and (6, — 6,). The values of both of these quantities are un- 


10 Professor of Mechanical Engineering, Michigan State University, 
East Lansing, Mich. Mem. ASME. 


693 


known of course because 6, is not known. Thus it appears that 
such a procedure would not provide a solution for 6,. 

In commenting on r;=0 in Fig. 6, Professor Carter warns 
that this is not to be regarded as link 3 in an equivalent four- 
bar linkage. True, the authors have not used r; as such and 
appreciate that this discusser’s comment should help further in 
preventing any misconception of this kind by the reader. 

Professor Freudenstein states that some of the velocity equa- 
tions can be read off at sight by considering the equivalent 
four-bar linkage and using the well-known theorem relating the 
angular velocity ratio to the ratio of the perpendiculars drawn 
from the cranks to intersect the connecting link. This is the 
same comment as is made by Professor Carter. The authors 
have been aware of this method, but throughout the paper the 
approach to both velocity and acceleration analysis has been 
by means of summation of relative velocity and acceleration 
vectors. This has been done to demonstrate the application of 
the general expressions presented in the Introduction. 

The use of complex conjugates as pointed out by Professor 
Freudenstein for eliminating unknown 6; in the four-bar linkage 
is indeed a further gain even though his equations for 6, and 
6, contain more terms than the authors’ Equations [8] and [12]. 

The comments by Professor Hall as to the desirability of ex- 
pressing position in a form which would be convenient for 
computational purposes, as the authors have done for velocity 
and acceleration, are well placed. 
the velocity and acceleration of any link in the mechanism in 
terms of the velocity and acceleration of the driving link alone, 
and it has been shown how the unknown velocities and accelera- 
tions of the other links in the kinematic chain can be eliminated. 
However, using exponentials, no means has been found to 
eliminate a sufficient number of unknown angles so that the 
position of any one link can be expressed in terms of the lengths 
of the links and the angular position of the driver alone. The 
method of using complex conjugates, as cited by Professor 
Freudenstein in his discussion, eliminates “unwanted” 6; from 
the position vector equation for the four-bar linkage. But, as 
he mentions, unwanted 6, still remains. The method commonly 
used in finding the angular positions of the links has been to 
use ordinary trigonometry. We know for example that in re- 
lating angles and lengths in the four-bar linkage it has been 
customary to work with the diagonal on the linkage. This is 
the method the authors have used after devoting considerable 
effort to find more efficient methods. 

The comment by Professor Hall that there is an infinite number 
of equivalent four-bar linkages for a direct-contact mechanism 
should be emphasized indeed. Professor Hall has explained how 
we can obtain equivalent four-bar linkages where all links are of 
finite length even for cams having flat-faced followers. Thus 
in addition to the solution of velocities and accelerations for 
direct-contact mechanisms as expressed by Equations [17] 
and [21], any direct-contact mechanism may be analyzed using 
the four-bar-linkage solution as expressed by Equations [7], 
[8], and [12]. 

In conclusion, the authors wish to thank Professors Carter, 
Freudenstein, Hall, and Hinkle for their contributions and for 
the trouble they have taken in discussing the paper. 


The authors’ equations give 


© 


. 

. 

$ 


